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DETECTION  PERFORMANCE  OF  GENERALIZED  LIKELIHOOD 
RATIO  PROCESSORS  FOR  RANDOM  SIGNALS  OF  UNKNOWN 
LOCATION,  STRUCTURE,  EXTENT,  AND  STRENGTH 

INTRODUCTION 

Reliable  detection  of  weak  signals  in  noise  is  aggravated 
when  the  signal  has  little  or  no  structure  that  can  be  utilized 
in  processing  the  received  waveform.  Yet,  this  is  a  problem 
frequently  encountered  in  practical  applications  and  which  must 
be  addressed  quantitatively  in  order  that  attainable  performance 
levels  can  be  established  and  realized.  If  significant  gains 
relative  to  simple  energy  detection  are  possible,  this  fact  must 
be  known;  also,  alternative  improved  processing  techniques  must 
be  discovered.  Furthermore,  the  robustness  of  the  alternative 
techniques  to  lack  of  knowledge  of  the  detailed  signal 
characteristics  is  a  critical  issue  that  must  be  addressed  and 
quantified. 

The  problem  we  consider  here  is  couchea  in  the  frequency 
domain,  where  a  known  search  region  of  N  disjoint  bins  contain 
noise  which  is  uniformly  distributed  over  that  entire  frequency 
region.  In  addition,  either  a  signal  is  present  in  M  of  those 
frequency  bins,  or  the  signal  is  absent  from  all  bins.  The 
processing  problem  is  to  maximize  the  detection  probability  when 
signal  is  present,  while  keeping  the  false  alarm  probability 
fixed  at  some  desirable  specified  low  level. 
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To  complicate  the  situation,  the  number  of  occupied  signal 
bins  M  is  unknown;  that  is,  the  extent  of  the  signal  coverage 
(total  bandwidth)  is  not  known  apriori.  Furthermore,  the  M 
occupied  signal  bins  need  not  be  adjacent  in  frequency  or  have 
any  discernible  pattern  in  frequency  space;  that  is,  the  signal 
spectrum  has  no  usable  structure  (such  as  harmonic  lines  or 
contiguous  bins)  that  might  aid  in  signal  processing  and 
detectability. 

Additionally,  the  actual  locations  L  of  the  particular  M 
occupied  signal  bins  (when  signal  is  present)  are  unknown,  except 
that  they  must  occur  somewhere  in  the  total  search  space  of  N 
bins.  It  is  assumed  that  all  of  the  possible  occupancy  patterns 
for  the  set  L  of  M  occupied  bins  are  allowed. 

Finally,  the  actual  average  signal  powers  per  bin  (presumed 
equal  to  a  common  value  S,  for  the  most  part  here),  are  not 
known;  lack  of  this  signal  strength  information  (as  well  as  no 
knowledge  of  M  or  L)  precludes  realization  of  any  optimum 
processing  technique,  which  would  necessarily  rely  on  and  use 
that  information. 

The  absence  of  knowledge  of  these  important  signal  parameters 
(M,  L,  S,  structure)  causes  us  to  adopt  maximum  likelihood 
estimation  procedures  and  their  attendant  generalized  likelihood 
ratio  processors.  Depending  on  the  particular  starting  points  of 
the  analyses,  namely  the  initial  assumptions  about  the  signal 
parameters,  different  forms  of  processors  result.  This  leads  to 
several  classes  of  processors  which  must  be  analyzed,  either 
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analytically  or  by  simulation.  Then,  comparisons  of  these 
processors  are  possible  and  required,  thereby  enabling 
establishment  of  a  baseline  performance  level  in  this  rather 
deleterious  environment.  Some  recent  work  along  this  line  is 
available  in  [1],  where  a  modified  generalized  likelihood  ratio 
processor  was  quantitatively  evaluated  in  terms  of  its  receiver 
operating  characteristics.  Knowledge  of  those  procedures  and 
results  is  presumed  of  the  reader  here. 

The  generalized  likelihood  ratio  processor  is  not  necessarily 
an  optimum  procedure  for  signal  detection.  Rather,  it  is  an  ad 
hoc  procedure  frequently  adopted  for  convenience,  rationality, 
and  for  the  fact  that  it  generally  yields  reasonable  processing 
forms  and  performance  capabilities.  However,  it  must  be  noted 
and  cautioned  there  are  cases  where  the  generalized  likelihood 
ratio  test  can  actually  yield  poor  performance  [2;  page  96]. 

Althoucn  the  present  search  and  detection  problem  has  been 
couched  in  the  frequency  domain,  this  is  done  solely  for  ease  of 
discussion.  The  analyses  and  results  actually  apply  to  any 
search  domain,  such  as  time,  distance,  angle,  or  combinations  of 
these  variables.  For  example,  a  typical  application  could 
require  a  search  in  a  combined  time,  frequency  space,  where  the 
total  search  region  of  N  bins  would  be  composed  of  a  rectangular 
region  of  size  N  =  N^.  However,  it  will  be  necessary  to 

investigate  if  the  fundamental  assumptions  utilized  in  this 
study,  such  as  lack  of  structure,  apply  in  the  particular 
domain(s)  of  interest  to  the  user. 
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This  technical  report  is  the  second  of  a  series  of  four  NUWC 
technical  reports  by  this  author,  covering  the  topics: 

(a)  modified  generalized  likelihood  ratio  processors, 

(b)  generalized  likelihood  ratio  processors, 

(c)  power-law  processors,  and 

(d)  optimum  processing, 

respectively.  Topic  (a)  was  completed  in  [1],  resulting  in  a 
substantial  compilation  of  receiver  operating  characteristics  for 
the  particular  modificatior  considered  there.  Topic  (b)  will  be 
addressed  in  this  report.  The  overall  goal  of  the  extended 
investigation  is  to  determine  classes  of  processors  which  perform 
at  or  near  optimum  levels  of  performance,  and  which  can  be  easily 
realized  and  analyzed,  even  in  these  situations  of  scant 
knowledge  about  the  detailed  signal  characteristics. 
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PROBLEM  DEFINITION 

The  search  space  consists  of  N  (frequency)  bins,  each 
containing  independent  identically-distributed  noises  of  unit 
power.  This  is  presumed  to  be  accomplished  by  an  earlier 
normalization  procedure.  The  number  N  is  under  our  control  and 
is  always  a  known  quantity.  When  signal  is  absent,  hypothesis 
Hq,  the  probability  density  function  of  each  of  the  bin  outputs 
is  completely  known. 

When  signal  is  present,  hypothesis  H^,  the  quantity  M  is  the 
actual  number  of  bins  occupied  by  signal.  When  M  is  unknown,  we 
will  hypothesize  that  M  bins  are  occupied  by  signal.  (If  M  is 
presumed  known,  we  can  take  M  equal  to  that  presumed  known  value 
if  we  please. ) 

The  quantity  L  is  the  actual  set  of  bins  occupied  by  signal, 
when  signal  is  present;  for  example,  if  M  -  4,  then  we  might  have 
the  set  L  =  {2,3,7,29}.  When  L  is  unknown,  we  hypothesize  that 
L  is  the  occupied  set  of  bins,  for  the  previously  hypothesized 
value  of  M.  Thus,  the  size  of  set  L  is  equal  to  M.  (If  L  is 

presumed  known,  we  can  take  L  equal  to  that  presumed  known  set  if 

we  please . ) 

The  quantities  {Sn)  are  the  actual  average  signal  powers  per 
bin  in  occupied  set  L,  when  signal  is  present.  When  these  signal 

powers  {Sn}  are  unknown,  we  hypothesize  signal  powers  {Sn  or  SJ 

for  hypothesized  size  M  and  set  L,  depending  on  whether  we 
presume  these  powers  are  all  different  or  all  equal, 
respectively.  (These  average  signal  powers  |SnJ  will  usually  be 
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unknown  in  practical  applications.) 

For  these  hypothesized  quantities  M,  L,  and  | Sn  or  S)  of  the 
unknown  parameters,  we  can  determine  the  likelihood  ratio  for  a 
given  (random)  observation  { xn } .  Then,  since  the  probability 
density  function  Pg  of  the  observation  (xn)  under  hypothesis  HQ 
is  completely  known,  we  can  maximize  this  likelihood  ratio 
(instead  of  maximizing  density  p1  of  the  observation  }xn)  under 
hypothesis  )  by  variation  of  M,  L,  and  {Sn  or  S},  over  all 
their  allowed  values,  thereby  obtaining  maximum  likelihood 
estimates  M,  L,  and  }Sn  or  S}  of  the  unknown  parameters.  These 
estimates  are  random  variables,  because  they  depend  on  the 
particular  observation  (xnJ.  Comparison  of  this  maximum 
likelihood  ratio  value,  called  the  generalized  likelihood  ratio, 
with  a  fixed  threshold  constitutes  the  generalized  likelihood 
ratio  test.  Simplifications  of  this  test  are  frequently 
possible. 

It  was  mentioned  above  that  when  the  number  of  bins  M 
occupied  by  signal  is  presumed  known,  we  could  set  M  equal  to 
this  presumed  value,  thereby  eliminating  the  search  on  this 
parameter.  However,  a  problem  with  this  approach  is  that,  in 
practical  applications,  the  actual  number  of  occupied  signal 
bins,  M,  may  be  different  from  the  number  M  presumed  during  the 
derivation  of  the  processor.  This  mismatch  between  the  presumed 
and  actual  numbers  can  lead  to  a  degradation  in  performance. 
Quantitative  evaluation  of  this  degradation  is  one  of  the  main 
topics  of  this  study. 
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There  are  three  cases  that  can  obtain  relative  to  the 
available  knowledge  about  the  value  of  M,  the  actual  number  of 
occupied  signal  bins.  In  the  first  case,  M  could  be  known 
exactly;  that  is,  the  total  signal  frequency  extent  is  known 
exactly,  although  the  precise  bin  locations  and  structure  are 
not.  This  might  arise  in  trying  to  intercept  a  frequency-dodging 
diversity-combining  communication  message. 

In  the  second  case,  M  might  be  completely  unknown;  that  is, 
the  signal  frequency  extent  could  be  anything,  from  a  very  narrow 
band  (tonals)  up  to  a  broad  band  of  frequencies.  This  situation 
could  occur  when  there  is  no  apriori  knowledge  about  the  signal 
to  be  detected.  It  could  also  occur  in  the  initial  stages  of 
searching  for  a  general  signal  of  unknown  center  frequency  and 
extent. 

Finally,  in  the  third  case,  size  M  might  be  partially  known. 
Thus,  the  signal  frequency  extent  may  be  known  within  fairly 
broad  limits,  say,  for  example,  from  50  Hertz  to  200  Hertz, 
within  a  total  search  band  of  1000  Hertz.  This  situation  could 
obtain  when  partial  information  is  available  about  the  signal  of 
interest.  The  three  different  cases  will  naturally  lead  to 
different  processors,  each  of  which  makes  use  of  the  information 
available  to  it. 
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DERIVATION  OF  GENERALIZED  LIKELIHOOD  RATIO  TESTS 

PROBABILITY  DENSITY  FUNCTIONS  FOR  KNOWN  SIGNAL  PARAMETER  VALUES 

We  now  specify  the  detailed  character  of  the  probability 
density  functions  Pq  and  p^,  introduced  above,  under  hypotheses 
Hq  and  Hj,  respectively.  In  both  hypotheses,  the  bin  outputs  or 
observations  {*n)  are  taken  as  the  squared  envelopes  of 
outputs  of  (disjoint)  narrowband  filters  subject  to  a  Gau., _  lan 
input  random  process;  alternatively,  the  observations  are  the 
magnitude-squared  outputs  of  a  fast  Fourier  transform  subject  to 
a  Gaussian  input  process.  It  is  assumed  that  these  outputs  {x  l 
are  statistically  independent  of  each  other,  which  is  consistent 
with  a  frequency-disjoint  requirement. 

Since  the  bin  output  noise  has  been  normalized  at  unit  level, 
the  probability  density  function  of  the  n-th  observation  xn  is, 
under  hypothesis  HQ,  an  exponential  of  the  form 

q0(un)  =  exp(-un)  for  un  >  0  ,  1  <  n  <  N  .  (1) 

When  signal  is  present,  with  signal  power  Sn  in  the  n-th  bin,  the 
density  of  xn  is  changed  in  this  signal-present  hypothesis  to 

ql (un)  ■  an  exp(-an  un)  for  un>0,  l£n<N,  (2) 

where  we  have  defined  the  parameter 

a_  =  -g-  <  1  for  1  <  n  <  N  .  (3) 

n  1  -n 

Observe  that  actual  signal  power  Sn  can  also  be  interpreted  as 
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the  actual  signal-to-noise  power  ratio  per  bin,  since  the  noise 
power  per  bin  has  been  normalized  at  unity. 

The  probability  density  function  governing  the  complete 
observation  {xnJ  under  Hq  follows  from  (1)  and  the  statistical 
independence  as 

N 

P0(u1,  . . .  ,uN)  =  TTfexp(-un)  }  .  (4) 

n=l 

On  the  other  hand,  under  H^,  the  pertinent  density  is,  from  (2), 

p1(u1, . . . ,uN)  =  T~Tlan  exp(-an  un) j  r~Tlexp( -un) }  ,  (5) 

neL  nifL 


where  L  is  the  actual  occupied  set  of  signal  bins. 

If  L  and  (Sn)  were  all  known,  the  likelihood  ratio  for 
observation  fxn)  would  be  given  by  random  variable 


where  we  have  defined  weights 


— n  =  1  -  — n  =  rrV 

— n 


for  neL 


Therefore,  the  likelihood  ratio  test  in  this  ideal  case  is  given 
by  the  weighted  linear  sum  comparison  with  fixed  threshold  v: 


x 


n 


> 

< 


v  . 


(8) 
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PROBABILITY  DENSITY  FUNCTIONS  FOR  UNKNOWN  SIGNAL  PARAMETER  VALUES 

Unfortunately,  the  optimum  test  in  (8)  cannot  be  realized  in 
practice,  because  the  occupied  set  L  will  not  be  known  and  the 
signal  strengths  (Sn)  in  the  occupied  bins  will  not  be  known.  To 
circumvent  these  problems,  we  will  now  consider  employing  maximum 
likelihood  estimates  for  the  unknown  parameters. 

When  signal  is  present,  suppose  we  hypothesize  that: 

(1)  M  bins  are  occupied  by  signal, 

(2)  L  is  the  specific  set  of  M  occupied  bins,  and 

(3)  ( Sn }  are  the  signal  powers  per  bin  for  neL.  (9) 

Then,  by  reference  to  (5),  the  probability  density  function 
governing  the  observations  under  Hj  is 

p1(u1, . . . ,uN)  ■  7“Tlan  exp(-an  un) }  TT(exp(-un) }  ,  (10) 

ncL  n/fL 

where  we  defined  parameters 

a  =  }■  c  £  1  for  1  <  n  <  N  .  (11) 

n  l  +  an 

On  the  other  hand,  under  HQ,  we  simply  set  all  signal  powers 
{ Sn }  to  zero,  obtaining  probability  density  function  (4)  again. 
Since  this  latter  function,  pQ,  is  independent  of  all  the 
parameters  hypothesized  above  in  (9),  we  can  obtain  the  maximum 
likelihood  estimates  by  maximizing  the  likelihood  ratio  instead 
of  maximizing  probability  density  function  p^.  The  likelihood 
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ratio,  for  given  random  observation  (xn),  parallels  (6)  and  (7), 
but  using  (10)  now,  namely 


Pi (*i / • • • /*N;  _ _ \ 

“ s  po(*i.  x"’j  •  <i2> 


At  this  point,  in  order  to  make  further  progress,  we  have  to 
consider  two  different  possibilities  for  the  signal  strengths; 
these  are: 


(1)  all  powers  equal:  Sn  =  S  for  neL;  or 

(2)  all  powers  different:  Sn  arbitrary  for  neL.  (13) 

We  will  derive  the  generalized  likelihood  ratios  for  both 
situations,  and  then  extract  the  corresponding  tests  for  two 
different  cases  of  knowledge  about  M,  the  actual  number  of  bins 
occupied  by  signal. 

The  problem  we  are  addressing  here  is  a  noise  in  noise 
problem;  that  is,  the  filter  bank  input  is  a  white  Gaussian 
process  under  Hq,  while  it  is  a  colored  Gaussian  process  under 
Hj.  However,  the  coloring  is  not  known,  nor  is  the  extent  or 
strength  of  the  coloring  known.  This  lack  of  information  leads 
to  complications  in  signal  processing  and  in  the  subsequent 
determination  of  the  detection  capability. 
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ALL  SIGNAL  POWERS  EQUAL 

Here,  we  presume  that  all  the  signal  powers  are  equal  in  the 
occupied  bins,  and  we  set  Sn  =  S  1  0  for  all  n  ir.  hypothesized 
set  L.  Then,  define  parameter 


a 


__1 _ 

1  +  S 


<  1  . 


(14) 


The  likelihood  ratio  in  (12)  reduces  to 

LR  =  aM  exp [ ( 1  -  a)  YU  xn)  #  (15) 

v  neL  ; 

where  we  used  the  fact  that  set  L  is  of  size  M.  An  important 
observation  to  make  immediately  from  (15)  is  that  the  data  { xn } 
will  be  subject  to  addition  of  linear  quantities  in  these 
observations,  regardless  of  how  we  choose  M  and  L,  at  least  in 
this  case  of  presumed  equal  signal  powers. 

The  value  of  parameter  a  that  maximizes  LR  in  (15)  is  random 
variable 

a  =  min  (l  ,M/  YU  x  ]  ,  (16) 

v  neL  ' 

where  we  satisfied  constraint  (14).  The  corresponding  signal 

power  estimate  is,  from  (14), 

S  =  maxfo,^  YU  (xn_1)]  *  (17) 

n  neL  ‘ 

Substitution  of  estimate  (16)  in  (15)  yields  the  generalized 
likelihood  ratio 
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GLR 


(18) 


where  monotonically  increasing  function  g  is  defined  by 

(x  —  1  -  ln(x)  for  x  ^  1] 

g(x)  a  {  \  .  (19) 

[  0  for  x  <  lj 

Now,  we  must  maximize  the  argument  of  the  monotonically 
increasing  function  exp  in  (18)  by  choice  of  set  L  and  size  M. 

If  the  actual  set  size  M  is  known,  then  we  would  take 
hypothesized  value  M  equal  to  known  value  M;  reference  to  (18) 
and  (19)  reveals  that  we  could  then  concentrate  on  maximizing  the 
sum  in  (18),  where  set  L  is  now  restricted  to  be  of  size  M.  But, 
set  L  should  obviously  then  be  taken  to  correspond  to  the  M 
largest  members  of  observation  (xn). 

At  this  point,  it  is  expedient  to  consider  ordering  the 
measured  data  l*nl»  Specifically,  order  the  given  data  l*nl  from 
largest  to  smallest  according  to 

XJ  2  x2  ~  ' *  *  -  XN  *  (20) 


Thus,  xj^  is  the  largest  element  of  fxn),  while  x^  is  the  smallest 

element  of  {x  1. 

n 

We  can  now  easily  achieve  the  desired  maximum  of  the  sum  in 
(18)  in  terms  of  the  ordered  random  variables  {x^}?  namely,  the 
generalized  likelihood  ratio  test  takes  the  form 

M 

IZ  *„  <  v  •  <211 

n=  1 
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That  is,  we  must  order  the  data,  add  the  first  M  ordered 
variables,  and  compare  with  a  fixed  threshold  v.  This  is  the 
generalized  likelihood  ratio  test  for  equal  signal  powers  and 
known  occupancy  size  M. 

This  test  makes  a  great  deal  of  sense.  The  only  way  that  a 
signal  displays  its  presence  in  the  observed  data  {xnJ  is  through 
an  increase  in  the  means  in  M  bins  of  unknown  location;  see  (1) 
and  (2),  where  the  mean  increases  from  1  to  1+S  in  the  n-th  bin, 
when  signal  is  present.  Test  (21)  says  to  consider  the  M  largest 
data  values  and  see  if  their  sample  mean  is  sufficiently  large  to 
declare  that  a  signal  is  present. 

On  the  other  hand,  if  the  actual  set  size  M  is  not  known,  the 
generalized  likelihood  ratio  in  (18)  dictates  the  test 


max 

M11M£M2 


T  *n) )  =  5  < v '  (22> 


where  the  size  of  set  L  is  M,  and  it  is  presumed  that  set  size  M 
is  known  to  be  within  a  range  of  values,  which  must  be 

searched.  Again,  the  ordered  data  must  be  linearly  summed,  then 
averaged,  but  now  subjected  to  monotonic  transformation  g  in 
(19).  Although  there  is  no  obvious  physical  interpretation  of 
test  (22),  it  is  easily  realized  once  the  given  data  (xnJ  has 
been  ordered;  the  search  on  M  itself  is  not  too  time  consuming. 
Operation  (22)  is  the  generalized  likelihood  ratio  test  for  equal 
signal  powers  and  unknown  occupancy  size  M. 
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ALL  SIGNAL  POWERS  ARBITRARY 

Here,  we  presume  that  hypothesized  signal  power  Sn  >  0  for 
all  n  in  hypothesized  set  L  of  size  M.  The  likelihood  ratio  was 
given  in  (12)  for  this  case,  where  parameter  an  was  given  by 
(11).  The  value  of  an  that  maximizes  the  n-th  term  in  (12)  is 
random  variable 

an  =  min{l,l/xn)  for  neL  .  (23) 

The  corresponding  maximum  of  (12)  is  then  the  generalized 
likelihood  ratio 

GLR  =  1—T{exp[g(x  )]}  =  expfo  g(x  )]  /  (24) 

where  function  g  was  defined  in  (19).  It  is  important  to  observe 
here,  in  the  case  of  arbitrary  signal  powers,  that  the  given  data 
{ xn }  is  always  transformed  according  to  nonlinearity  g,  prior  to 
being  summed  over  hypothesized  set  L;  this  is  in  contrast  to  the 
linear  sum  (15)  for  equal  signal  powers. 

Now,  we  must  maximize  the  argument  of  the  exp  function  in 
(24)  by  choice  of  set  L  and  its  size  M.  If  the  actual  set  size  M 
is  known,  then  we  would  take  M  equal  to  M;  then,  using  the 
monotonicity  of  function  g,  set  L  in  (24)  should  obviously  be 
taken  to  correspond  to  the  M  largest  members  of  data  (xn).  This 
leads  to  the  test 

M 

HZ  g(xA>  <  v  •  {25) 

n=l 
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This  is  the  generalized  likelihood  ratio  test  for  arbitrary 
signal  powers  and  known  occupancy  size  M. 

On  the  other  hand,  if  the  actual  set  size  M  is  unknown,  the 
generalized  likelihood  ratio  in  (24)  dictates  the  test 


M. 


n=l 


g(x') 


> 

< 


(26) 


where  it  is  presumed  that  there  is  some  upper  bound,  M2,  on  the 
range  of  values  allowed  for  M.  Processor  (26)  is  the  generalized 
likelihood  ratio  test  for  arbitrary  signal  powers  and  unknown 
occupancy  size  M. 

Both  tests,  (25)  and  (26),  subject  the  ordered  data  {x^j  to 

a  small-signal  suppression  effect,  which  is  inherent  in  function 

2 

g,  prior  to  summation.  That  is,  from  (19),  g(x)  ~  Jj(x  -  1)  for 
x  ~  1.  However,  if  M  or  M2  is  a  small  fraction  of  the  total 
search  size  N,  the  larger  data  values  in  the  ordered  set  will 
dominate  these  tests.  And  since  function  g  in  (19)  is  nearly 
linear  for  larger  arguments,  the  sums  in  (25)  and  (26)  will  be 
essentially  linear  sums  in  this  situation  of  small  M/N  or  M2/N, 
respectively. 
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SUMMARY  OF  ANALYTICAL  RESULTS  FOR  ORDERED  DATA 


The  generalized  likelihood  ratio  test  for  equal  signal  powers 
and  known  occupancy  size  M  was  given  in  (21)  in  the  form 


M 


ez 

n=  1 


x' 

n 


> 

< 


v  , 


(27) 


where  {x^j  is  the  ordered  version  of  given  data  {*nl-  However, 
when  M  is  unknown,  this  test  cannot  be  realized.  Nevertheless, 
it  does  suggest  a  closely  related  alternative;  namely,  for 
hypothesized  size  M,  consider  the  sum  of  the  M  largest  random 
variables,  where  M  is  a  best  guess  or  mid-range  value  of  M.  That 
is,  consider  the  decision  variable  z  given  by 

M  . 

2  s  EH  '  V  •  (28) 

n-1  n  ' 

The  performance  of  this  sum-of-M-largest  processor  will  depend 
upon  both  the  hypothesized  siie  M  as  well  the  actual  size  M  of 
the  occupied  signal  bins,  in  addition  to  the  actual  signal  power 
per  bin,  S.  Evaluation  of  the  receiver  operating  characteristics 
of  this  processor  will  occupy  much  of  the  remaining  effort  here. 

Although  the  original  data  {x^}  is  independent  identically- 
distributed  exponential  random  variables  under  HQ,  the  ordered 
data  (x^J  is  not  independent,  not  identically  distributed,  and 
not  exponential  or  Gaussian.  Thus,  (28)  constitutes  the 
classical  problem  of  finding  the  statistics  of  the  output  of  a 
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digital  filter  subject  to  a  non-Gaussian  input  process  with 
dependent  data  values. 

The  most  important  analytical  problem  of  interest  here  is  in 
finding  the  first-order  statistics  of  random  variable  z  defined 
in  (28).  This  problem  is  solved  in  appendix  A,  in  particular  in 
(A-ll),  where  the  characteristic  function  of  z  is  derived  exactly 
in  closed  form  for  hypothesis  Hq,  namely  independent  identically- 
distributed  exponential  random  variables  for  original  input  l*nl* 
At  the  same  time,  with  the  framework  already  established,  a 
number  of  extensions  have  been  analyzed,  which  could  form  the 
bases  for  future  studies  on  signal  processing  involving  ordered 
data.  These  results  are  summarized  below. 

The  characteristic  function  of  the  p-th  largest  random 
variable  x'  is  given  in  (A-23),  while  the  joint  characteristic 

r 

function  of  the  p-th  and  v-th  largest  random  variables,  x'  and 

r 

x^,  is  given  in  (A-25),  and  the  characteristic  function  of 

difference  x^  -  x^  is  given  in  (A-33).  The  joint  characteristic 

function  of  x'  ,...,x'  is  given  in  (A-34),  and  then  specialized 
^1 

to  the  joint  characteristic  function  of  x£,...,x^  in  (A-38). 

Some  more  general  cases  are  undertaken  in  appendix  B.  The 
joint  characteristic  function  of  x^,...,x'  for  independent  but 
differently  distributed  exponential  random  variables  is  given  in 
(B-12),  and  then  specialized  in  (B-17)  to  the  case  where  a  subset 
of  M  of  these  variables  have  one  density  while  the  remaining  N-M 
random  variables  have  a  different  density.  This  has  obvious 
applications  to  finding  the  statistics  of  sum  variable  z  under 
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hypothesis  rather  than  Hq. 

The  further  specialization  to  all  N  variables  having  the  same 
density  leads  to  (B-18)  for  the  joint  characteristic  function. 
Finally,  the  characteristic  function  of  a  weighted  sum  of  ordered 
random  variables  is  given  by  (B-20).  This  last  investigation  is 
aimed  at  trying  to  improve  the  performance  of  test  (28)  by 
replacing  M  by  N,  and  by  weighting  the  larger  data  points  more 
heavily.  In  particular,  in  appendix  C,  the  maximum  deflection  of 
the  weighted  sum  of  ordered  data  is  solved  in  (C-9)  and  (C-10). 
However,  the  deflection  is  not  a  complete  descriptor  of 
performance,  involving  no  more  than  second-order  moments.  Also, 
(C-10)  requires  extremely  accurate  calculation  of  the  means 
and  {PQnl  i-n  order  to  retain  any  significant  digits  in  the 
optimum  weights  {wn}. 

Finally,  in  appendix  D,  an  investigation  into  distortion  of 
the  data,  after  it  has  been  ordered,  is  conducted.  The 
motivation  behind  this  study  is  again  to  see  if  improved 
detection  performance  can  be  achieved.  For  independent 
identically-distributed  random  variables  with  arbitrary 
probability  density  function  p  and  arbitrary  distortion  function 
h,  the  characteristic  function  of  the  sum  of  the  first  M 
distorted  random  variables  is  obtained  exactly  in  (D-10)  or 
(D-U). 

The  rest  of  appendix  D  is  devoted  to  special  cases.  In 
particular,  the  characteristic  function  of  the  sum  of  the  M 
largest  random  variables  in  a  set  of  N  independent  identically- 
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distributed  exponential  random  variables  is  given  by  (D-23). 

On  the  other  hand,  when  the  distorted  random  variables  are 
squared  prior  to  summation,  the  pertinent  characteristic  function 
is  given  instead  by  (D-26)  and  (D-24). 

When  these  latter  two  problems  are  reworked  for  independent 
identically-distributed  Gaussian  random  variables  rather  than 
exponential  random  variables,  the  characteristic  function  for  the 
linear  sum  is  given  by  (D-28)  and  (D-27),  while  the  corresponding 
result  for  a  sum  of  squares  is  given  in  (D-30)  and  (D-29). 

Finally,  if  the  original  random  variables  {*nJ  are  N 
independent  identically-distributed  chi-squared  random  variables 
with  2K  degrees  of  freedom,  the  characteristic  function  of  the 
sum  of  the  M  largest  random  variables  is  given  by  (D-33). 
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SUM-OF-M-LARGEST  PROCESSOR 

This  processor  is  characterized  in  (28)  as  the  sum  of  the 
first  M  random  variables  of  the  ordered  data  set  {x^},  which  is 
equivalent  to  the  sum  of  the  M  largest  random  variables  of 
original  set  (xnJ  of  size  N.  The  decision  variable  is 

M  s 

Z  -  EZ  <  V  .  (28) 

n=l  ' 


CHARACTERISTIC  FUNCTION  UNDER  HQ 

Under  hypothesis  HQ,  when  noise  alone  is  present,  and  data 
{xnJ  is  composed  of  N  independent  identically-distributed 
exponential  random  variables,  the  exact  characteristic  function 
of  output  z  is  given  by  (A-ll)  as  the  compact  closed  form 


f(U 


(29) 


A  couple  of  checks  on  this  result  are  possible.  First,  for 

— N 

M  =  N,  (29)  reduces  to  (1  -  i£)  ,  which  is  obviously  correct, 

since  z  is  then  simply  the  sum  of  all  the  original  data  |xn).  On 
the  other  hand,  for  M  =  1,  z  is  the  largest  random  variable  in 
set  {*nlr  that  is,  z  =  x^  =  max(xn|,  and  (29)  reduces  to 


f1(l) 


(30) 
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To  verify  this  last  result,  observe  that  the  cumulative 

N 

distribution  function  of  under  Hq  is  [1  -  exp(-v)]  for 

threshold  v  >  0,  since  all  the  exponential  random  variables  jxn| 

must  stay  below  v.  The  corresponding  probability  density 

N- 1 

function  is  N  exp(-u)  [1  -  exp(-u)]  for  u  >  0,  for  which  the 
characteristic  function  (Fourier  transform)  is  just  (30).  Here, 
we  used  result  (A-17)  with  the  identifications  a  =  1,  (3  ■  1,  and 
K  =  N  -  1. 

In  this  special  case  of  M  =  1,  that  is  z  =  max{xn},  the 
exceedance  distribution  function  of  output  z  under  is  also 
immediately  available  in  closed  form.  It  is  given  by 

E*(v)  =  Pr ( z  >  v|H1)  =  1  -  [1  -  exp ( -av ) ]—  [1  -  exp(-v)]N  -,  (31) 

where  we  have  used  (1)  and  (2)  with  common  signal  power  S  and 
parameter  a  s  (1  +  S)-1.  Mis  the  actual  number  of  bins  occupied 
by  signal. 

Returning  now  to  general  results  (28)  and  (29)  for  arbitrary 
N  and  M,  the  mean  of  sum  z  under  Hq  is  readily  found  to  be 

N  M 

z  =  M  +  YIZ  n  *  (32) 

n=M+l  n 

This  result  is  required  when  we  employ  the  numerical  techniques 
in  [3]  for  accurately  and  efficiently  determining  the  exceedance 
distribution  function  directly  from  the  characteristic  function. 
In  this  case  of  hypothesis  Hq,  we  will  obtain  the  false  alarm 
probability  directly  from  characteristic  function  (29),  that  is, 
Pf  =  Pr(z  >  v j  Hq ) ,  for  arbitrary  M  and  N. 
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FALSE  ALARM  PROBABILITY 

For  M  =  1,  that  is,  z  -  max|xnl,  the  false  alarm  probability 
follows  directly  from  (31)  by  setting  signal  power  S  =  0,  that 
is,  a  =  1,  thereby  getting 

Pf(M=l)  =  Pr ( z  >  v | HQ )  =  1  -  [1  -  exp(-v)]N  for  v  >  0  .  (33) 

Since  this  result  is  easily  evaluated,  no  plots  are  presented 
here  for  this  special  case  of  M  =  1,  for  the  sake  of  brevity. 
However,  false  alarm  probability  (33)  and  detection  probability 
(31)  will  be  used  later  to  generate  some  of  the  tabular  results 
that  follow. 

Similarly,  for  M  =  N,  where  z  is  the  sum  of  all  the  random 
variables  {xnJ,  no  analyses  or  plots  are  given  here,  because  they 
have  already  been  given  in  [1;  pages  21  -  22  and  81  -  90].  Thus, 
for  the  most  part,  attention  is  confined  to  1  <  M  <  N  here. 

In  figure  1,  the  false  alarm  probability  P^  for  M  =  2  and  for 
N  =  2  with  k  =  1(1)10  is  plotted  versus  threshold  v  over  a  range 
including  P^  values  down  to  IE-6.  Similar  sets  of  false  alarm 
probability  plots  are  presented  in  figures  2  -  10  as  M  varies 
over  the  values  3,  4,  8,  16,  32,  64,  128,  256,  512,  respectively. 
In  every  case,  the  smallest  value  of  N  considered  is  M,  because 
sum  z  in  (28)  is  only  defined  for  M  <  N.  These  results  are  very 
accurate,  even  at  the  IE-6  level,  because  the  efficient  fast 
Fourier  transform  procedure  in  [3]  was  employed  for  going 
directly  from  exact  characteristic  function  (29)  to  the  false 
alarm  probability,  with  insignificant  aliasing  error. 


25 


TR  10739 


26 


r 


10 


b~  o 


C> 


Figure  2.  False  Alarm 
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Figure  3.  False  Alarm  Probability  for  M  =  4 
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Figure  4.  False  Alarm  Probability  for  M  =  8 
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Figure  6.  False  Alarm  Probability  for  M  =  32 
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Figure  8.  False  Alarm  Probability  for  M  =  128 
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DETECTION  PROBABILITY 

In  all  the  numerical  results  to  follow,  the  total  search 
space  will  be  taken  at  the  fixed  value  N  =  1024.  Then,  actual 
signal  size  M  will  be  allowed  to  vary  over  the  12  values 

M  -  1,  2,  3,  4,  8,  16,  32,  64,  128,  256,  512,  1024,  (34) 

which  allows  us  to  consider  signal  structures  varying  all  the  way 
from  tonals  to  very  broadband  processes.  At  the  same  time, 
hypothesized  signal  size  M  will  vary  over  the  10  values 

M  «  2,  3,  4,  8,  16,  32,  64,  128,  256,  512,  (35) 

consistent  with  the  theme  above  that  there  is  no  need  to  present 
the  results  for  M  =  1  or  M  =  N.  Thus,  we  have  120  cases  to 
consider  for  the  detection  probability  of  the  sum-of-M-largest 
processor.  This  will  allow  us  to  extract  extensive  quantitative 
results  on  the  degradation  suffered  by  mismatching  hypothesized 
value  M  to  actual  (unknown)  value  M. 

Under  hypothesis  H^,  when  signal  is  present  in  M  bins  of  the 
data  (*nJ  leading  to  sum  z  in  (28),  the  characteristic  function 
of  output  z  cannot  be  found  in  any  practically  useful  closed 
form.  This  is  true  even  if  the  signal  powers  are  all  equal  to 
a  common  value  S  in  the  M  occupied  bins.  This  conclusion  is 
based  on  the  analytical  results  presented  in  appendix  B, 
especially  (B-15)  -  (B-17)  coupled  with  (B-3)  -  (B-4).  A 
numerical  example  for  N  =  4  and  M  =  2  in  (B-24)  and  sequel 
illustrates  the  severe  complexity  of  the  general  result. 
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This  inability  to  analytically  obtain  the  characteristic 
function  of  z  under  has  forced  us  to  simulate  the  detection 
probability  results.  An  example  of  the  resultant  receiver 
operating  characteristics  is  given  in  figure  11,  where  8300 
independent  trials  were  used  for  the  ten  different  values  of 
signal  power  S  considered.  The  abscissa  value,  false  alarm 
probability  P^,  of  each  point  on  these  plots  is  exact,  having 
been  obtained  by  means  of  (29).  Therefore,  we  are  able  to 
reliably  carry  these  curves  down  to  the  small  values  for 
around  IE- 6. 

However,  the  ordinate  value,  detection  probability  P^,  has 
jitter  (random  perturbations)  in  it  due  to  the  limited  number  of 
independent  trials,  namely  8300  for  this  particular  example. 

Since  we  are  usually  interested  in  Pd  values  in  the  neighborhood 
of  .5  to  .9,  this  number  of  trials  is  sufficient  to  generate  an 
accurate  receiver  operating  characteristic  in  the  range  of 
interest.  The  jitter  is  most  noticeable  in  the  unimportant  upper 
right-hand  corner  of  the  figure  where  large  P^  values  near  .99 
are  being  estimated,  and  where  P^  is  too  large  to  be  practically 
useful. 

The  totality  of  120  receiver  operating  characteristics 
generated  by  the  cases  listed  in  (34)  and  (35)  are  collected 
together  in  appendix  E.  The  number  of  trials  varies  widely,  from 
5600  to  35000  trials,  depending  upon  the  time  that  happened  to  be 
available  for  the  particular  run.  The  signal  power  per  bin,  S  in 
dB,  always  varies  over  a  range  sufficient  to  cover  the  useful 
values  of  false  alarm  and  detection  probabilities. 
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Figure  11.  Operating  Characteristic  for  N  =  1024,  M  =  1,  M  =  2 
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REQUIRED  SIGNAL-TO-NOISE  RATIO 

The  voluminous  compilation  (120  plots)  of  receiver  operating 
characteristics  in  appendix  E  forces  us  to  condense  this 
information  for  easier  interpretation  and  accessibility.  To 
accomplish  this,  we  define  a  low-quality  operating  point 
Pf  =  IE-3 ,  P^  =  .5  and  a  high-quality  operating  point  Pf  =  IE-6, 
P^  *  .9.  We  then  read  off  the  curves  in  appendix  E  the  values  of 
signal  power  S(dB)  which  are  required  to  realize  these  two  levels 
of  performance.  These  results  are  tabulated  in  tables  1  and  2 
for  the  low-quality  and  high-quality  operating  points, 
respectively,  for  M  and  M  both  ranging  over  the  full  set  of 
values  1,  2,  3,  4,  8,  16,  32,  64,  128,  256,  512,  1024. 

It  is  immediately  seen  that  the  best  value  of  M,  for  minimum 
S,  is  not  necessarily  M,  although  the  discrepancies  are  small. 

For  example,  if  M  =  16,  the  best  value  for  M  is  smaller  than  M, 
being  in  the  range  4  to  8.  On  the  other  hand,  if  H  =  256,  the 
best  M  is  larger  than  M,  namely  512.  The  explanation  for  this 
effect  is  given  in  appendix  F;  it  has  to  do  with  the  fact  that 
the  generalized  likelihood  ratio  processor  is  not  necessarily 
optimum. 

The  results  in  tables  1  and  2,  are  plotted  in  figures  12  and 
13,  respectively,  with  one  modification;  the  ordinates  in  the 
figures  are  the  total  signal  power  required,  M  S  in  dB,  rather 
than  just  the  bin  power,  S.  The  total  quantity  is  more 
meaningful  and  it  condenses  the  range  of  ordinate  values  to  a 
more  manageable  regime. 
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Table  2.  Required  S(dB)  for  Sum-of-M-Largest  Processor,  N  =  1024, 
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Numerous  observations  can  be  made  from  these  two  figures.  We 
confine  the  following  numerical  examples  to  figure  12.  When 
M  *  N  =  1024,  the  optimum  value  of  M  is  also  1024,  although  the 
loss  in  using  M  =  512  is  less  than  .1  dB.  However,  if  we 
continue  to  use  M  =  1024  when  M  has  been  decreased  significantly 
below  1024,  losses  around  9  dB,  relative  to  using  M  =  1,  will  be 
incurred  when  M  =  1  is  reached. 

On  the  other  hand,  suppose  we  attempt  to  always  use  M  =  1 
regardless  of  the  true  value  M.  Although  this  selection  is 
satisfactory  for  M  less  than  32  approximately,  it  will  run  into 
losses  greater  than  10  dB  when  the  actual  value  M  approaches 
1024. 

This  suggests  that  if  nothing  whatsoever  is  known  about  M,  a 
compromise  value  of  M  =  32  might  be  adopted.  The  loss  when  M  is 
actually  1024  is  then  2.5  dB,  whereas  the  loss  when  M  is  actually 
1  is  3.5  dB.  For  intermediate  values  of  M,  say  from  16  to  64, 
the  consistent  use  of  M  «  32  is  nearly  optimum  within  this  class 
of  processors.  Thus,  if  there  is  some  partial  information 
available  about  the  range  of  M  values  to  be  encountered,  and  if 
this  range  is  narrow  enough,  figures  12  and  13  indicate  what  the 
good  choices  of  M  are  and  the  degree  of  loss  caused  by  mismatch. 

Figures  12  and  13  bring  out  one  of  the  significant  drawbacks 
of  the  sum-of-M-largest  processor,  namely  that  a  good  choice  of 
upper  limit  M  in  sum  (28)  cannot  be  made  without  some  knowledge 
about  M,  the  number  of  bins  occupied  by  signal.  It  also 
indicates  that  further  study  into  the  determination  of  a  more 
robust  class  of  processors  is  warranted  and  required. 
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COMPARISON  WITH  MODIFIED  GENERALIZED  LIKELIHOOD  RATIO  PROCESSOR 


The  modified  generalized  likel'hood  ratio  (MGLR )  processor 
[1]  was  derived  on  the  basis  of  an  unknown  number  of  occupied 
signal  bins,  M;  in  fact,  the  average  signal  strength  Sn  was 
estimated  in  each  and  every  one  of  the  N  bins.  However,  during 
data  processing,  if  the  MGLR  processor  uses  the  best  value  of 
its  breakpoint  xQ  for  the  actual  current  value  of  M,  the  MGLR 
processor  has  effectively  been  given  knowledge  of  M.  Also,  since 
the  best  breakpoint  value  of  xQ  is  typically  large  for  small  M/N, 
the  small-input  square-law  suppression  of  nonlinearity 


go(x) 


ln(  x ) 
0 


for  x  2.  x 
for  x  <  x 


(36) 


is  never  encountered,  since  the  nonlinearity  is  virtually  linear 

for  x  >  x^  in  this  situation, 
o 

Thus,  when  the  MGLR  processor  is  effectively  using  knowledge 
of  small  values  M,  it  is  linearly  processing  only  the  largest 
members  of  observation  (*n}»  This  is  exactly  what  the  GLR 
processor  considered  here  does,  except  that  the  number  of 
contributors  to  the  GLR  processor  output  is  always  exactly  M 
(regardless  of  M) ,  whereas  the  MGLR  processor  output  number  of 
contributors  fluctuates,  depending  on  the  actual  input  data  {xnl, 
N,  S,  and  M. 
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This  means  that  we  can  expect  the  best  performers  in  the  MGLR 
class  of  processors  to  have  comparable  performance  to  the  best 
performers  in  the  sum-of-M-largest  class  of  processors.  This  is 
borne  out  by  the  results  in  figures  14  and  15,  which  are 
extracted  from  the  receiver  operating  characteristics  for  the 
MGLR  processor  in  [1].  The  lower  envelopes  of  figures  12  and  14 
are  very  close  over  their  common  range  from  M  =  8  to  H  =  256; 
similarly,  the  lower  envelopes  of  figures  13  and  15  are  fairly 
close  to  each  other. 
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K 

Figure  14.  MGLR  Total  Signal  Power  for  Pf  =  IE-3,  Pd  =  .5 
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SUMMARY 

The  receiver  operating  characteristics  of  the  sum-of-M- 
largest  processor  have  been  determined  for  a  wide  variety  of 
values  of  M,  the  number  of  bins  occupied  by  signal,  and  M,  the 
hypothesized  number  of  occupied  bins,  for  total  search  size 
N  =  1024.  The  false  alarm  probability  was  very  accurately 
evaluated  by  using  the  exact  characteristic  function  of  the 
decision  variable  under  HqJ  the  detection  probability  was 
determined  by  simulations,  each  averaging  about  10000  independent 
trials. 

The  amount  of  loss  associated  with  mismatch  between  M  and  M 
can  be  significant.  However,  for  N  =  1024,  if  one  compromises  on 
and  uses  the  intermediate  value  of  M  -  32,  regardless  of  the  true 
(unknown)  value  of  M,  the  loss  is  no  more  than  about  3  dB.  If 
some  partial  knowledge  about  the  range  of  values  of  M  is 
available,  a  mid-range  choice  of  M  can  be  made,  with  reduced 
losses  in  mismatch.  Quantitative  assessment  of  these  losses  are 
possible  from  the  results  given  here. 

Some  new  results  for  the  characteristic  function  of  a 
weighted  sum  of  ordered  data  have  been  derived,  and  then  used  for 
false  alarm  probability  calculations.  Extensions  to  the 
characteristic  function  of  the  sum  of  distorted  ordered  data,  as 
well  as  to  some  joint  characteristic  functions,  have  also  been 
accomplished,  although  they  were  not  used  to  obtain  the  numerical 
results  here. 
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APPENDIX  A.  CHARACTERISTIC  FUNCTION  OF  SUM  OF  M  LARGEST 
RANDOM  VARIABLES  OF  A  SET  OF  N  EXPONENTIAL  RANDOM  VARIABLES 

Consider  N  independent  identically-distributed  random 
variables  (RVs)  (xnJ  with  common  continuous  probability  density 
function  p  and  cumulative  distribution  function  C.  Then,  the 
probability  that  the  p-th  largest  random  variable  x^  lies  in 
interval  u,u+du  is  given  by  [4;  page  370] 

g  (u)  du  =  Pr{N-p  RVs  <  u;  p-1  RVs  >  u+du;  1  RV  in  du|  = 

=  N  (JJ'J)  C(u)N'^  [1  -  C(u)]p_1  p(u)  du  .  (A-l) 

The  probability  density  function  of  the  p-th  largest  random 

variable  is  g  . 

/J 

More  generally,  consider  two  intervals  centered  on  the  values 
u  and  v,  where  u  >  v,  and  let  integers  p  <  v.  Then,  the 
probability  that  the  p-th  largest  random  variable  x'  lies  in 

H 

u, u+du  and  that  the  v-th  largest  random  variable  x^  lies  in 

v, v+dv  is 

g^v(u,v)  du  dv  = 

=  Prfp-1  RVs  c  (u,®);  v-p-1  RVs  e  (v,u);  N-v  RVs  e  (-°°,v); 

1  RV  e  ( u ,  u+du ) ;  1  RV  e  ( v ,  v+dv )  J  = 

=  N  (N-l )  (”“J)  (^-v1)  [1  "  -  C(V)]V'P_1  x 

x  C(v)N-v  p(u)  p(v)  du  dv  .  (A-2) 
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The  joint  probability  density  function  of  the  jj-th  and  v-th 
largest  random  variables  of  set  jxn)  is 

Continuing  in  a  similar  fashion,  the  joint  probability 
density  function  of  the  M  largest  random  variables,  out  of  the 
set  of  N  independent  identically-distributed  random  variables 
{xnJ,  is,  for  values  Uj  >  u2  >  •••  >  uM, 

N  M  M 

g(Ul,u2,...,uM)  =  N{N-1)**-{N-M+1)  C(um)n~”  TT  P(um)  .  (A-3) 

m=  1 

We  are  interested  in  the  characteristic  function  of  the  sum  s  of 
the  M  largest  random  variables  in  set  {*nl*  Notice  that  the 
components  {x^}  of  this  sum  consist  of  M  statistical ly-dependent 
non-Gaussian  non- identically  distributed  random  variables.  The 
desired  characteristic  function  is  given  by 


fs(£)  =  exp(iSs)  ■  exp(i£[x'  +  •••  +  x^])  = 

=  dui  du2 '  ’  *  duM  g(uru2r  ••  "V  expfitluj  +  •••  +  uM])  = 


u, 


u 


M-l 


=  J  dUj  expfiSuj)  |  du2  exp(ilu2)***  J  duM  exp(i£uM)  g(u1,...,uM) 


-  |  duM  exp(i?;uM)  •  *  •  J  du2  exp(i£u2)  J  du1  exp(i£ux)  g(ux , . . .  ,uM) . 

(A-4) 


u3  u2 


The  last  form  in  (A-4)  is  more  useful,  because  of  the  way  that 
cumulative  C(uM)  appears  in  the  density  g(u^, . . . ,uM) ;  namely,  the 
integration  on  C(uM)  is  deferred  to  the  last  integral  in  (A-4). 
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In  general,  substitution  of  ( A—  3 )  in  (A-4)  results  in  an 
intractable  multiple  integral.  However,  for  the  case  of  an 
exponential  probability  density  function  for  original  random 
variables  {xnj, 


p(u)  =  exp(-u)  for  u  >  0  ,  (A-5) 

all  the  manipulations  can  be  carried  out  in  closed  form.  To 
demonstrate  this,  substitute  (A-3)  and  (A-5)  into  (A-4)  to  obtain 


fs(S)  ■  F  J  duM  exp(-zuM)  [1  -  exp(-uM))N_M  J  du^j  exp(-zuM1)  x 


u 


M 


x  •••f  du2  exp(-zu2)  J  dUj^  exp(-zu^)  , 


(A- 6 ) 


u. 


u. 


where  we  have  defined 


F  =  N(N-l) • • • (N-M+l)  ,  Z  =  1  —  U  .  ( A-7 ) 

Denote  the  integral  on  general  term  dum  in  (A-6  as  1^. 

Then,  it  is  readily  verified  that 


exp(-zu2)  exp(-2zu3)  exp( - (M-l ) zuM) 

11  "  i  '  12  TT- .  Ifl-1  =  ,M-1 )  !  z"-1  • 

(A- 8 ) 

Utilization  of  the  last  result  in  (A-8)  yields,  from  (A-6), 

00 

f  (S)  =  - - — o  [  duM  [1  -  exp(-u  )]N-M  exp(-MzuM)  .  (A-9) 

*  (M-l) !  zn  1  J  n  " 
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At  this  point,  we  use  the  integral  result 


J  du  [ 1  -  exp(-u)]K  exp(-cu)  =Jdt(l-t)Ktc* 


r|r(ctK^i) 1  -  T5T^77  £or  K  inte9er  '  Re(c)  ”  0  '  (A'10) 


which  is  available  from  [5;  8.380  1  and  8.384  1].  Then,  ( A— 9 ) 
takes  on  the  final  form  for  the  characteristic  function  of  the 
sum  s  of  the  M  largest  random  variables  of  set  (*nl/  namely 


£s<£> 


(N-M) ! 


(M-l)l  zM-1  *MZ)N-M+1 


(1  -  U) 


M-l 


N 


n=M 


1  -  it 


5) 


( A- 1 1 ) 


where  we  have  used  (A-7)  to  simplify  the  end  result.  This 
compact  closed  form  expression  for  £  (t)  is  readily  numerically 
evaluated  and  is  well  suited  to  the  numerical  methods  in  [3] 
for  accurately  and  efficiently  determining  the  false  alarm 
probability  of  sum  random  variable  s.  (More  generally,  the 
characteristic  function  of  an  arbitrary  weighted  sum  of  ordered 
data  is  accomplished  in  (B-20)  -  (B-21)). 

It  is  also  interesting  to  observe  that  (A-ll)  corresponds  to 
the  characteristic  function  of  a  sum  of  N  independent  exponential 
random  variables  { yn }  with  non-identical  probability  density 
functions 


'1 

for 

1  < 

n  <  M 

Pn(u)  *  an  exP<-anu>  with  an  =  ‘ 

n 

,M 

for 

M+l 

£  n  £  N 
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The  first  H  random  variables  of  auxiliary  set  |ynl  have  mean 
values  1,  while  the  remainder  have  decreasing  mean  values  M/n  for 
M+l  i  n  i  N. 

The  cumulants  of  the  sum  random  variable  s  follow  upon 
expansion  of  the  natural  logarithm  of  (A-ll)  in  a  power  series  in 
i£;  the  k-th  cumulant  is 

X_(k)  =  (k-1 ) !  (m  +  Mk  JZ2  -t]  for  k  il  .  (A-13) 

s  '  n=M+l  nK; 

In  particular,  the  mean  and  variance  of  sum  s  are 

N  .  N 

X_(l)  «  M  +  M  JZZ  £  '  X-(2 )  =  M  +  YU  -4  *  (A- 14  ) 

n=M+l  5  n=M+l  n 

The  mean  of  s  tends  logarithmically  to  infinity  as  N  -*  ®,  while 
the  variance  and  all  the  other  higher-order  cumulants  of  s 
saturate  at  finite  values  as  N  increases. 

The  characteristic  function  fs(S)  of  sum  s  in  (A-ll)  can  be 
written  in  the  form 
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The  latter  relation  can  be  verified  directly  by  using  result 
(A- 10 )  with  K  ■  N-M-l  and  c  =  M+l-i£M.  However,  the  convolution 
of  the  two  density  functions  pfl  and  in  (A- 16)  has  not  led  to 
any  useful  expressions  for  the  probability  density  function  or 
exceedance  distribution  function  of  sum  s. 

More  generally,  the  following  are  a  Fourier  transform  pair: 


Pc(u) 


K ! 


'a' 


K 


K+l 


exp(-au)  [1  -  exp(-pu)]  for  u  >  0 


f 


c 


(1) 


K  integer  ,  a  >  0  ,  0  >  0  .  (A-17) 


The  k-th  cumulant  of  this  pair  is 

x  (k)  =  (k-l)l  ZZ  - - - k  for  k  “  1  '  (A_18) 

c  n=0  (a  +  0n ) 

which  is  a  finite  sum  of  positive  terms.  This  means  that  the 
corresponding  k-th  moment. 


PC(k)  =  J  du  uK  Pc(u) 
0 


(> 


e~^u  K  ( A- 19 ) 


can  be  expressed  as  a  finite  collection  of  positive  terms. 
Specifically,  the  k-th  moment  can  be  easily  and  accurately 
obtained,  to  high  order,  by  the  recursion  [6;  page  94,  (A-6)] 


Pr(*)  =  ZZI  [\l]  Xc(k-m)  fj  (m)  for  k  >  1  ,  v  (0)  =  1  ,  (A-20) 
c  m=0  v  m  ;  c 


which  involves  only  positive  terms.  For  example, 
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Aic(l)  »  XC(D  ,  Pc(2)  =  Xc(2)  +  Xc(  1 5  Pc(  1)  =  Xc(2)  +  *1(1)  , 

Pc(3)  =  Xc(3)  +  2  xc(2)  pc(l)  +  xc(D  Pc(2)  = 

“  Xc(3)  +  3  xc(2)  XC(D  +  Xc(!)  •  (A-21) 

By  letting  t  =  exp(-u)  in  (A- 19),  we  have  the  ability  to 
evaluate  the  following  integral  in  terms  of  finite  sums  of 
positive  quantities: 

*c<k>  -  KT  (?)_..  I  dt  Jk  t0-1  (l  -  tP)K  .  ( A-22 ) 

K+l  Q 

The  larger  k  values  will  require  use  of  positive  recursion 
(A-20).  Parameters  k  and  K  are  integers,  while  a  and  p  are 
positive  real. 

As  an  application  of  the  result  in  (A-17),  the  characteristic 
function  of  the  p-th  largest  random  variable  in  (A-l),  for 
probability  density  function  (A-5),  is  immediately  found  to  be 


which  contains  only  N+l-p  factors.  Here,  1  <  p  <  N.  The  k-th 
cumulant  of  the  p-th  largest  random  variable  is  therefore 

N  . 

Xu(k)  =  (k-l)l  YU  -r  for  k  2  1  .  (A-24) 

p  n=p  nK 

In  a  similar  vein,  the  joint  characteristic  function  of  the 
p-th  largest  random  variable  x'  and  the  v-th  largest  random 
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variable  x^,  with  p  <  v,  can  be  found  from  (A-2)  and  (A-5)  in 
closed  form 

*  exp{Ux^  +  iCx^)  *  JJ  du  dv  gpN)(u,v)  exp(Uu  +  i£.v) 
00  00 

-  J  dv  exp(iCv)  J  du  exp(Uu)  g  (u,v)  - 


~lU  "  Tr)  "TTf1  "  f  for  1  i  fj  <  v  i  N  .  ( A-25 ) 

=  n  '  =\i  *  I 


This  form  contains  a  total  of  N+1-/J  factors,  no  matter  what  value 
the  integer  v  has.  If  C  »  0,  (A-25)  reduces  to  (A-23),  as 
expected.  On  the  other  hand,  if  l  -  0,  then  (A-25)  reduces  to 


_  14 

n 


( A-26 ) 


which  has  a  form  identical  to  (A-23),  again  as  expected. 

The  joint  cumulants  of  the  y-th  largest  random  variable  x^ 
and  the  v-th  largest  random  variable  x^  of  set  {xn),  with  p  <  v, 
can  be  found  from  the  expansion  of  (A-25)  according  to 


In  f  U,t)  =  EZ  i[(U)k  ZZ  4  +  (H  +  iC)k  d  41  *  (A"27> 
k=l  M  n mu  nK  n=v  nKJ 


Thus,  we  have 


mean { x ' }  =  l _ .  -  ,  mean{x' )  =  > 

H  n-jj  n=v 


( A-28 ) 
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N 

var  l*'  J  =  HZ  ~ 

**  n=/j  n 


N 


varjx^J 


covi  xIj'K 


n=v  n 


(A-29) 


Observe  that  the  covariance  between  x^  and  x^  is  equal  to  the 
variance  of  x'.  Also,  the  covariance  coefficient  between  x'  and 

\>  (j 

x;  is 


1  £  p  <  v  £  N  . 


( A-30 ) 


Thus,  for  large  N,  the  two  largest  random  variables  of  a  set  of  N 

exponential  random  variables  have  a  covariance  coefficient  of 
2  k 

(n  -6)  vn  =  .626,  while  the  largest  and  smallest  random  variables 
have  a  covariance  coefficient  of  /6/(nN)  =  .780/N. 

More  generally,  the  k,<n  joint  cumulant  of  x^  and  x^  follows 
from  (h-21 )  as 


V>(k'm) 


where 


( k—  1 )  !  S^(k)  for  m  =  0  ,  k  i.  1 
(k+m-l)i  S^(k+ra)  for  m  £  1 


(A-31) 


Sj(k) 


n-j 


for  1  <  j  £  N  , 


1  £  k  . 


(A-32) 


The  characteristic  function  of  the  difference  between  the 
/j-th  largest  and  the  v-th  largest  random  variables  is  obtained 
directly  from  (A-25)  by  setting  C  =  -t: 


(Z.-Z) 


=  exptU(x^  -  x^)]  = 


( A-33 ) 
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which  has  v-p  factors.  Thus,  the  difference  of  adjacent  ordered 
random  variables,  that  is,  v  =  p  +  1,  has  an  exponential 
probability  density  function  with  mean  value  1/p. 


Most  generally,  it  is  shown  in  appendix  B  that  the  M-th  order 

joint  characteristic  function  of  the  pj-th  largest  random 

variable  x '  ,  the  p0-th  largest  random  variable  x'  ,...,  and  the 
^1  z  V2 

pM-th  largest  random  variable  x'  of  original  set  |x  },  for 


!  i  p2  <  p2 


<  pM  i  N,  is 


where  we  have  defined 


$m  =  £»1  +  £>2  +  •••  +  Cm  for  1  <  m  <  M  .  (A-36) 

The  characteristic  function  in  (A-35)  reduces  to  first-order 
result  ( A—  1 1 )  when  we  set  =  C2  =  ***  =  CM  =  lr  and  set  pj  =  1, 
P2  =  2 , . . . ,  pM  =  M.  A  numerical  confirmation  of  general  result 
(A-35)  was  obtained  by  simulation  for  the  fourth-order  example 

M  *  4,  N  =  19,  p^  =3,  p2  =  6,  p^  =  11,  p^  =  16, 

tj  =  .31,  C2  =  -.53,  £,3  =  .97,  £,4  =  .77.  (A-37) 
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The  exact  result  from  (A-35)  is  .71239  +  i  .65022,  whereas  the 
simulation  yielded  estimate  .71246  +  i  .65015,  based  upon 
5,000,000  independent  trials  of  ensemble  average  (A-34). 

An  application  of  (A-35)  to  the  M  largest  random  variables  of 
set  Jxnl  is  afforded  by  choosing  =  m  for  1  <  m  <  M,  to  obtain 
the  M-th  order  joint  characteristic  function 


If  we  set  Cj  =  C2  =  ***  =  CM  =  S,  and  use  (A-36),  this  reduces  to 
(A-U). 

The  results  above  can  be  used  to  find  statistics  of  some 
nonlinear  transformations  of  ordered  data  (x'j.  For  example,  the 
sum  of  the  M  largest  squares  is  given  by 


z 


M 

c  * 

p=i 


,2 

P  * 


From  ( A-24 ) ,  we  know  that 


(A-39) 


"  X/J(D 


Var(x^)  =  Xp(2) 


Therefore,  the  mean  of  z  is  given  by 


( A-40 ) 
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f  f  N 

\\ 2  » 

A 

EZ 

+  EZ 

4 

\(n=n  1 

n=p 

nZ) 

N  N  .  M-l  ,  -  x 

■  rZ  K5  +  M  —  +  LI  „  -  "  V,  *  Y2  .  (A-411 

n,m=l  n=M  n  n=l 


after  considerable  manipulation,  where 

N  i 

rk  $  i  +  nz  4 

k  n=M+l  nk 


(A-42) 
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APPENDIX  B.  JOINT  CHARACTERISTIC  FUNCTION 
OF  N  ORDERED  INDEPENDENT  RANDOM  VARIABLES 

Let  { xn }  be  a  set  of  N  independent  random  variables,  where 
the  probability  density  function  of  the  n-th  random  variable  xn 
is  p(u,n);  the  variables  need  not  be  identically  distributed. 
Order  these  random  variables  into  the  new  set  {x^j,  where 
X1  >  x2  >  ***  >  XN‘  We  evaluate  the  N-th  order  joint 

characteristic  function  of  this  ordered  set,  namely 

f  {llf . . .  ,SN)  =  exp(  i^x'  +  *  *  •  +  /  (B-l) 

for  arbitrary  values  of  l£nl/  when  probability  density  functions 
(p(u,n) )  of  (xn)  are  exponential.  That  is,  we  will  consider 

p(u,n)  =  a(n)  exp[-u  a(n)]  for  u>0,  1  <  n  <  N  .  (B-2) 

The  ordered  random  variables  (x^)  are  highly  statistically 
dependent  on  each  other  and  are  distinctly  non-Gauss ian. 

Observe  that  knowledge  of  (B-l)  allows  ready  evaluation  of 
the  characteristic  function  of  any  weighted  sum  of  the  ordered 
random  variables,  by  simply  choosing  £  =  l  wn.  That  is,  for  the 

weighted  sum  of  the  ordered  variables, 

N 

s  =  )  |  w  x '  ,  ( B-3 ) 

n=  1 

we  have  characteristic  function 


£s(l)  =  exp(ils)  =  exp 


(it  IZ  «n  *i)  -  t(t", 


t«N) 


( B-4  ) 
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Thus,  for  example,  by  choosing  only  the  first  M  weights  of 
sequence  (wn)  nonzero,  we  can  investigate  the  M  largest  random 
variables  of  original  set  (xnJ  of  size  N.  Alternatively,  for 
example,  by  taking  only  and  nonzero  in  (B-l),  we  can 
investigate  the  combination  of  the  third-largest  and  fifth- 
largest  random  variables  in  set  jxn). 

Before  we  begin  the  derivation  of  (B-l),  we  must  consider  all 
possible  permutations  of  the  integers  1,2,...,N.  For  N  *  3  for 
example,  there  are  the  Ni  =  6  possibilities  123,  132,  213,  231, 
312,  and  321.  We  label  these  six  sequences  according  to 


11  “ 

k12  "  2' 

kl  3  3; 

ii 

<N 

k22 

=  3, 

k23  *  2; 

31  =  2' 

k32  =  1 ' 

k33  =  3? 

k4 1  “  2' 

k42 

-  3, 

k4  3  "  1? 

51  =  3' 

k52  =  lf 

k53  =  2; 

k61  =  3' 

k62 

=  2, 

k63  = 

general 

,  the  j-th 

permutation 

out  of  the 

total 

of  N 

!  possible 

permutations  is  indicated  by  the  sequence  k^,  kj2,...,kjN*  Thus, 
we  have  a  matrix  of  integers  (kjn)  for  1  <  j  <  N 1 ,  1  <  n  <  N. 

The  joint  probability  density  function  of  the  ordered  set  of 


N  random  variables  Jx'  }  can  now  be  written  in  the  form 

n 

N!  N 

g(u1,u2, . . .  ,uN)  =  YZZ  FT  P(un,kin>  for  U1  >  u2  >  *“  >  un  ' 

j  =  1  n=l  J 

(E-5) 

and  zero  otherwise.  The  N-th  order  joint  characteristic  function 
of  the  ordered  random  variables  (x^)  is  then  given  by 


f(51, . . . ,SN)  =  exp(iC1x'  +  •••  +  ^nxN )  = 

J---J  du  x  •  •  •  duN  g(u1,...,uN)  exp(i?;1u1  +  •••  +  UNuN) 
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NJ  r  r 

C  J  dUN  P<UN'kjN>  eXP<UNUN>  J  dUN- 1  P<uH-l'kj#N-l)  X 

UN 


X  exp(ii;N_1uN_1)  X  ...  x  J  du2  P(u2 ,  kj  2 )  exp(U2u2)  * 


x  J  duj  p(u1,kj1)  expti^jUj)  .  (B-6) 

U2 

The  result  in  (B-6)  is  general,  holding  for  any  probability 
density  functions  {p(u,n)|.  However,  one  of  the  few  cases  where 
it  can  actually  be  evaluated  is  for  the  exponential  densities 
given  in  (B-2).  Substitution  yields 


f(?,M 


'V 


N ! 

=  XZ  I  duN  a(kJN)  exp(-uN  zjN) 

J  ^  A 


|  dlVl  a(kj,M-l»  exp(-uN-l  2 j  ,N-l) 


u 


N 


x  J  du2  a(kj2 )  exp[-u2  Zj2)  J  duj  a(k^)  exp(-u1  z^J  ,  (B-7) 


u- 


u. 


where  we  have  defined  complex  quantities 


zjn  ■  £(kjn)  "  i^n  for  15n£N,  1  <  j  <  N I 


( B-8  ) 


The  factor  involving  the  product  of  coefficients  is  given  by 
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N 

a<kjN)  a(kj  N_i)  x  ...  x  a{  k  j  2 )  afk.^)  =  J~J  a(n)  m  A  ,  (B-9) 

regardless  of  the  value  of  j,  the  particular  permutation. 

We  denote  the  integral  on  un  in  (B-7)  by  *jn/  and  let 

0jn  *  zjl  +  z j 2  +  ***  +  2 jn  for  1  *  n  <  N  ,  1  <  j  1  N!  .  (B-10) 


Then,  excluding  factor  a(kjn),  we  find,  in  order. 


exp(-u2  6^) 

rji  =  eji 


exp(-u3  S12) 

-  en  ej2 


exp(-uN  e.  „  j)  J 

1j.— t  - ' 9jt  v-e-7^7 


e 


jN 


(B-ll) 


The  use  of  (B-9)  and  (B-ll)  in  (B-7)  finally  yields  the  joint 

characteristic  function  of  the  ordered  random  variables  (x'|  in 

n 

the  form 


N! 


jj)  -  a 


j  =  l  9jl  6 j 2 


0jN 


(B-12) 


where  { 0 j n }  are  given  by  (B-10)  and  (B-8).  By  combining  these 
latter  expressions,  we  find,  for  1  <  j  <  N ! ,  1  £  n  <  N, 


0  •  =  h  .  -  i\l/ 

jn  'jn  rn 


'in 


n 

p=i 


a  (  k  .  ) 
-  DP' 


xb  = 
Yn 


n 

p~l 


(B-13 ) 


The  major  problem  with  result  (B-12)  is  the  impossibility  of 
evaluating  all  N!  terms,  especially  for  large  N.  Even  if  we  are 
interested  only  in  the  sum  of  the  first  M  terms  of  ordered 
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sequence  (x^j,  that  is,  the  sum  of  the  M  largest  random  variables 
of  original  sequence  (xn),  that  only  allows  for  the 
simplification  of  { J  in  (B-13)  to  the  form 

f n£  for  1  1  n  <  M| 

*n  -  \  '  CB-14) 

ImJ;  for  M+l  i  n  1  Nj 

where  we  have  let  £n  =  £  for  1  -  n  £  M,  and  zero  otherwise,  in 
(B-l).  There  is  no  accompanying  simplification  in  the  tedious 
calculation  of  thjnJ  in  (B-13). 

If  the  exponential  probability  density  functions  in  (B-2)  are 
characterized  by  equal  signal  components  in  the  first  M  terms 
(without  loss  of  generality)  and  noise  otherwise,  then  we  have 
the  special  case 

fa  for  1  <  n  i  M) 

a(n)  =  \  ~\  .  ( B-l 5 ) 

[ 1  otherwise  J 

Now,  consider  the  first  n  terms  of  the  j-th  permutation,  namely 

k. k. k-  .  Let  the  number  of  times  that  any  of  the 
j  1 '  j2  jn 

numbers  1,  2,...,  M  occurs  in  these  first  n  locations  of 
permutation  j  be  denoted  by  L(j,n).  Then,  from  (B-13)  and 
(B-15),  we  find 

h j n  =  a  L( j ,n)  +  [n  -  L(j,n)]  =  b  L(j,n)  +  n  ,  (B-16) 

where  we  have  defined  b  =  a  -  1.  The  joint  characteristic 
function  follows  from  (B-12)  as 


'  ) 


M  N!  N 

-  i'cn 


j^T  A=1  £  L{j'n)  +  n  -  i+n 


( B-17 ) 
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where  we  also  used  ( B—  9 )  to  determine  A.  This  result  requires 
determination  of  all  the  integers  {L(j,n)}  for  its  evaluation, 
which  appears  to  be  a  formidable  task,  in  general.  Furthermore, 
this  is  the  simpler  case  of  equal  signal  components  in  the  first 
M  terms. 

The  special  case  of  no  signal  in  any  bins  corresponds  to 
a  =  1  in  ( B— 15),  thereby  giving  b  =  0.  This  causes  all  uhe 
dependence  on  {L(j,n)J  in  (B-17)  to  disappear,  thereby  yielding 


where  sequence  (^n)  is  given  by  (B-13).  This  is  the  joint 

characteristic  function  (B-l)  of  ordered  sequence  {x^},  when  all 

the  original  random  variables  (xn)  have  the  common  probability 

density  function  exp(-u)  for  u  >  0. 

As  an  application  of  (B-18),  consider  that  all  the  |£nl  are 

zero,  except  that  £  ^  0,  £  *  0,...,  £  ^  0.  Then,  (B-13) 

bx  V2  PM 

yields 


IV  =  10 


( B— 19) 


in  which  case  (B-18)  reduces  to  (A-34)  -  (A-36). 

*We  can  also  use  (B-18)  to  determine  the  characteristic 
function  of  the  weighted  sum  s  of  (x^j  defined  in  (B-3),  for  the 
case  of  no  signal  present.  Namely,  from  (B-4)  and  (B-13)  with 
the  identification  of  £  as  £  w  ,  there  follows  the 

r  kr 

characteristic  function  of  the  noise  1 y  sum  s  as 
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where  the  n-th  coefficient 


(B-20) 


Wn  =  n  k=T  Wk 


for  1  £  n  £  N 


( B-2 1 ) 


can  be  interpreted  as  the  average  of  the  first  n  weights.  The 
k-th  cumulant  of  noise-only  sum  s  follows  from  (B-20)  as 


(k)  =  ( k-1 ) ! 


for  k  >  1 


( B-22 ) 


For  the  special  case  of  all  the  weights  fwn)  equal  to  1,  then 

-N 

Wn  *=  1  for  all  n,  and  (B-20)  reduces  to  (1  -  i£)  as  expected, 
since  the  total  sum  of  the  ordered  data  is  equal  to  the  sum  of 
the  original  data,  and  there  is  no  signal  present.  On  the  other 
hand,  if  only  the  first  M  weights  (wn)  are  1,  while  the  remainder 
are  zero,  then  Wn  =  1  for  1  £  n  £  M,  and  Wn  =  M/n  for  M  <  n  £  N. 
Then,  (B-20)  immediately  reduces  to  (A-ll),  as  expected  for  this 
noise-only  case. 

As  a  partial  check  on  general  result  ( B— 17),  let  M  =  N,  which 
means  that  all  N  bins  contain  equal  signal  components  (when 
signal  is  present).  Then,  by  the  definition  under  (B-15),  we 
have  L(j,n)  =  n,  independent  of  permutation  number  j.  The 
characteristic  function  in  (B-17)  then  simplifies  to 
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which  is  an  obvious  generalization  of  (B-18). 

As  an  illustration  of  the  type  of  analysis  required  to 
simplify  (B-17)  for  equal  signal  components,  consider  the  example 
of  N  a  4,  M  =  2.  Of  the  Ni  =  24  possible  sequences  that  jL(j,n)j 
can  take  on,  there  are  only  6  different  kinds  that  can  occur; 
they  are 

1222,  1122,  1112,  0122,  0112,  0012. 

Furthermore,  each  type  occurs  exactly  4  times.  Expression 
(B-17)  then  specializes  to 

f(Ej,t2,t3,t4)  =  4a2(i-  +  •••  ♦  y  , 

where 

D:  =  (b+l-i^1) (2b+2-i^2) (2b+3-i^3) (2b+4-i^4 )  , 

D2  =  (b+l-i^M  b+2-i^2)(2b+3-ix|/3)(2b+4-ivj;4)  , 

D3  =  (b+l-itJ^M  b+2-i»^2 )  (  b+3-iv);3)(2b+4-i^4)  , 

D4  -  (  l-i^M  b+2-ivP2)(2b+3-i^3)(2b+4-ii|;4)  , 

D5  =  (  l-itl^M  b+2-ixjy2)(  b+3-i4/3)(2b+4-i^4)  , 

D6  =  {  1-i^M  2-i^2)(  b+3-i\J/3 )  ( 2b+4-iiJ'4  )  . 

For  the  numerical  example  of  =  .31,  S2  =  -.53,  S3  =  .97, 

=  .77,  a  =  .71,  the  exact  answer  from  (B-24)  -  (B-25)  is 
.474694  +  i  .653188,  while  a  simulation  result  based  on 
27,000,000  statistically  independent  trials  yielded  estimate 
.474647  +  i  .653168. 


(B-24) 


(B-25  ) 
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Other  numerical  examples  have  indicated  that,  in  general, 
there  are  binomial  coefficient  (N|M)  different  possible  sequences 
for  (L(j,n)}.  Although  this  integer  (N|M)  can  be  significantly 
less  than  Ni,  it  is  still  much  too  large  for  most  practical 
situations  where  N  is  generally  much  larger  than  1.  Application 
of  general  results  (B-12)  or  (B-17)  for  the  joint  characteristic 
function  of  the  ordered  data  appears  to  be  limited  to  very 
special  cases. 
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APPENDIX  C.  MAXIMUM  DEFLECTION  OF  WEIGHTED  SUM  OF  ORDERED  DATA 


The  random  variable  of  interest  here  is 


w  x'  , 
n  n  ' 


(C-l) 


where  original  data  (xn)  is  composed  of  independent  and 
identically-distributed  exponential  random  variables.  It  has 
been  ordered  into  descending  set  {x^}.  Under  HQ,  the 
characteristic  function  of  z  is,  from  (B-20)  and  (B-21), 


N  /  x-1  ,  n 

V*)  *  TT[l  -  «WnJ  ,  Wn  =  iCwp  for  1  £  n  £  N  .  (C-2 


If  we  were  given  coefficients  { Wn } ,  we  can  solve  for  weights  Jwn) 
according  to  (with  Wn  s  0) 


w  =  n  W  -  (n-1)  W  .  for  1  <  n  <  N 
n  n  n-1 


( C-3 ) 


The  mean  and  variance  of  z  under  Hn  are,  directly  from  (C-2), 


N  7  N  N  . 

"0  ■  LZ  wn  '  °0  -  C  w„  '  X0(k>  -  EZ  «*  • 

n=i  n= l  n=l 


( C-4  ) 


Now,  let  the  means  of  the  n-th  random  variable  under 
and  Hq  be  and  l»Qn'  respectively.  Define 


=  "in  -  "On  for  1  <  n  £  N  . 


(C-5 ) 


Then,  the  difference  of  the  mean  outputs  of  z  is,  using  (C-3), 
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^  V^ln  "  *W  *  2_-  wn  An  = 
n= I  n=l 


CZ  4n(n  Wn  -  (n-1)  Wn.j)  -  \ 


)  ,  (C-6) 


where  AN+1  -  0.  Also,  the  deflection  of  z  is 


(Az)2 

2 


i£t  W*  n  (An  * 


an+l») 


The  optimum  coefficients  |Wn)  for  maximum  d  follow 
immediately  from  (C-7)  as 


( C-7 ) 


Wn  =  a  n  (An  -  An+1)  for  1  <  n  <  N  ,  (a  arbitrary) 


( C-8 ) 


giving 


j2  _  r — ■  2  ,  .  .  .2 

max  A--.‘  ^  n  ^n+1^ 

n=  1 


(C-9) 


The  optimum  weights  for  maximum  deflection  are,  from  (C-3) 
and  (C-8)  (with  AQ  s  0), 

*n  ■  "  5n  -  <n-1'  Vl  *  a(n2<an  '  '  V)  ' 

-  a([n2+(n-l)2]An  -  n2An+1  -  (n-l)2^)  for  1  S  n  1  N  .  (C-10) 


Finally,  we  can  scale  everything  by  choice  of  a  so  that  =  1, 
without  loss  of  generality;  then,  =  1. 
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APPENDIX  D.  CHARACTERISTIC  FUNCTION  OF  THE  SUM  OF  THE  DISTORTED 
M  LARGEST  RANDOM  VARIABLES  OF  AN  INDEPENDENT  SET  OF  SIZE  N 

Real  random  variables  (xn),  1  i  n  1  N,  are  independent  and 
identically  distributed,  with  arbitrary  probability  density 
function  p,  cumulative  distribution  function  C,  and  exceedance 
distribution  function  E.  We  order  this  original  set  of  random 
variables  into  a  new  set  (x^j,  where 

xi  *  *2  2  2  *N  •  (D-1) 

This  ordered  set  of  random  variables  is  non-Gaussian,  heavily 
statistically  dependent,  and  not  identically  distributed. 

We  select  the  initial  M  random  variables  of  the  ordered  set 
fx^J,  that  is,  the  M  largest  random  variables  of  original  set 
fxn),  and  subject  them  to  the  common  arbitrary  memoryless 
nonlinear  transformation  h  (which  could  be  complex).  We  then  sum 
these  M  distorted  random  variables,  obtaining  the  output  random 
variable  s  of  interest: 


M 

s  =  £Z!  h(xA>  •  (D-2, 

n=l  n 

We  are  interested  in  obtaining  the  exact  characteristic  function 
of  s,  for  general  N,  M,  h,  and  p,  despite  the  deleterious 
statistical  properties  of  the  ordered  set  { ,  that  were  noted 
under  (D-l).  In  particular,  we  want  the  statistical  average 

fs(£)  =  exp(iSs)  =  expfiCh(x^)  +  •••  +  iShfx^)]  .  (D-3) 
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From  (A-4),  the  joint  probability  density  function  of  the  M 
largest  random  variables  of  set  fxn)  is  given  by 


g(u, 


'UM> 


=  F  p(u.) 


P(UM>  C(um) 


N-M 


M 1 


( D-4  ) 


for  i.  u2  £  £  uM,  where  F  =  N(N-1 )  •  •  •  (N+l-M) .  Therefore, 

average  (D-3)  can  be  expressed  as  the  multiple  integral 


fg(S)  -  F  J  duM  exp[ iSh(uM) ]  P(uM)  C(uM)N  M  x 


X  J  duM- 1  exp[iCh(uM_1) ]  P(uM_1) 


x  • 


u 


M 


x  • 


J  du2  exp[ii;h{u2)  ]  P(u2)  J  du1  exp[i^h(u1)]  p(u1)  .  (D-5) 


u. 


u. 


In  order  to  simplify  this  multiple  integral,  define  function 


E(u;£)  s  J  dx  exp[i£h(x) ]  p(x)  .  (D-6) 

u 

This  integral  is  presumed  convergent  for  all  u  where  p  is  non¬ 
zero.  Special  cases  of  (D-6)  are  E(“>;£)  =  0,  and  E(u;0)  =  E(u), 
which  is  the  ordinary  exceedance  distribution  function  of 
original  set  (xnl*  Also,  E(-®,£)  is  the  characteristic  function 
of  the  output  of  nonlinear  device  h  subject  to  a  random  input 
with  probability  density  function  p.  Thus,  E(u;£)  is  a  mixture 
of  an  exceedance  distribution  and  a  characteristic  function. 
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Holding  parameter  £  fixed,  the  derivative  of  E(u;t)  with 
respect  to  u  is  denoted  by  a  prime,  getting 

E'(u;£)  =  gjj  E(u;£)  =  -  exp[Uh(u)]  p(u)  .  (D-7) 

Now,  denote  the  general  integral  on  um  in  (D-5)  as  Im*  Then,  we 
immediately  have  Ix  =  E(u2;t).  Proceeding  to  the  integral  on  u2 
in  (D-5),  we  can  develop  it  as 

00 

I2  *  J  du2  exp[ i£h(u2 ) ]  p(u2)  E(u2;l)  « 
u3 


=  -  J  du2  E'(u2;J;)  E(u 2;l)  =  \  E(u3;l)2  .  (D-8) 

u3 

Continuing  in  this  fashion,  one  integral  at  a  time  in  (D-5),  we 
arrive  at  the  result  for  the  integral,  namely 


Finally,  the  last  integral  on  uM  in  (D-5)  can  be  expressed  as 
00 

fsU)  -  M  (JJ)  J  du  exp[Uh(u)J  p(u)  C(u)N_M  E(u^)M_1  ,  (D-10) 
—  00 

where  we  have  simplified  the  leading  constant  F/(M-1)!.  This 
result  in  (D-10)  holds  for  N  1  M.  It  is  a  single  integral  for 
the  characteristic  function  of  sum  random  variable  s  defined  in 
(D-2).  If  N  >  M,  we  can  integrate  by  parts  on  (D-10),  using 
(D-7),  to  obtain  the  alternative 
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fs(£)  =  (N-M)  (JJ)  J  du  p(u)  C(u)N"M_1  E(u;?;)M  .  (D-ll) 

—  00 

It  should  be  noted  that  unequal  weights  {wnJ  in  sum  (D-2)  are 
strictly  disallowed  in  the  current  analysis.  The  simplification 
in  (D-8)  and  (D-9)  occurs  only  when  exactly  the  same  h  function 
appears  in  the  successive  integrals  on  u^  through  uM_^.  A 
generalization  that  includes  weights  in  (D-2)  is  only  possible 
for  very  special  probability  density  functions  and  nonlinear 
transformations.  One  such  case  is  exponential  p  and  linear  h. 

Several  checks  on  these  general  results  are  possible.  For 
£  =  0,  we  have  from  (D-ll)  and  (D-6), 

00 

fs(0)  =  (N-M)  (”)  J  du  p(u)  C(U)N~M_1  E(u)M  = 

—  oo 

=  (N-M)  (JJ)  |  dx  xN"M-1  (1  -  X)M  =  1  ,  (D-12) 

0 

using  [5;  8.380  1  and  8.384  1]. 

On  the  other  hand,  for  M  =  N,  we  use  (D-10),  (D-6),  and  (D-7) 
to  obtain 

00 

fs(Z)  =  N  J  du  [-E'  (u;fc)  ]  E(u;?;)N"1  =  E(-°°;£)N  = 

—  OO 

_ N 

=  [J  dx  exp[  i?(h(  x )  ]  p(x)j  =  exp[  i£;h(  xn )  ]  .  (D-13) 
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This  latter  expression  is  recognized  as  the  characteristic 
function  of  the  sum  s,  when  it  is  observed  that,  for  M  *  N, 

M=N  N 

S  =  HZ  h(x')  *  HZ  h(0  /  (D-14  ) 

n=l  n  n=l  n 

and  that  original  set  (xn)  is  composed  of  independent  identically 
distributed  random  variables  with  probability  density  function  p. 
Finally,  for  M  =  1,  (D-10)  and  (D-7)  yield 

00 

fsU)  =  N  |  du  exp[Uh(u)]  p(u)  C(u)N_1  = 

—  00 

00 

=  |  du  exp[Hh(u)]  ^  C(u)N  =  exp[Hh(x'  )  ]  .  (D-15) 

—  CO 

The  last  step  in  (D-15)  is  accomplished  by  observing  that  the 
cumulative  distribution  function  of  the  maximum,  x£,  of  set  |xn) 
is  just  C(u)  .  And,  for  M  =  1,  (D-2)  reduces  to  s  *  h(x^). 

MEAN  OF  SUM  s  IN  (D-2) 

In  order  to  find  the  mean  of  the  random  variable  s  defined  by 
(D-2),  we  begin  by  expanding  (D-6)  in  a  series  about  l  ~  0: 

E(u;t)  ~  E ( u )  +  U  H(u)  as  Z,  -*  0  ,  (D-16) 

where 

00 

H(u)  h  J  dx  h(x)  p(x)  .  ( D-17 ) 

u 
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Then,  for  small  t,  ( D— 11)  yields 


fgU)  =  (N-M)  (JJ)  |  du  p(u)  C(u)N-M_1  [E(u)M  +  i£M  E(u)M_1  H  ( u )  ]  = 

—  00 

GO 

*  1  +  U  M  (N-M)  (Jj)  |  du  p(u)  C(u)N_M*1  E(u)M~l  H(u)  ,  (D-18) 


upon  use  of  (D-12).  Therefore,  the  mean  of  sum  s  is  given  by  the 
single  integral 


Ps  -  M  (N-M)  (Jj)  J  du  p(U)  C(U)N  M_1  E(u)M  1  H  ( u )  .  (D-19 ) 


A  similar  expansion  of  E(u;£)  and  f  (£)  to  second  order  gives  the 

5 

second  moment  of  s  in  the  form 


M  (N-M)  [Jj]  J  du  p  ( u )  Ctu)1*"”"1  E(u  )M-2  |e  ( u )  H(u)  +  (M-l)  H(u)2] 


( D-20 ) 


where 


H(u)  =  J  dx  h(x)2  p ( x )  . 


( D-2 1 ) 
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EXAMPLE  1:  h(x)  “x,  p(u)  =  exp(-u)  for  u  >  0. 


According  to  (D-2),  this  corresponds  to 


S  —  \  9 

n 


( D-22 ) 


which  is  the  sum  of  the  M  largest  exponential  random  variables. 
Then,  (D-6 )  yields  E(u;£)  =  exp( -u( 1-ifc) )/( 1-iS)  for  u  >  0,  and 
(D-ll)  becomes,  for  N  >  M, 


fs(E)  -  (N-M)  («)  — j  du  e‘“  (l-e-u) 


N-M-l 


-u(l-i£|M  _ 


=  (N)  _HzM  f  dx  (I_X)N-M-1  X(1-H)M  . 

lM)  <1-U)M  i 


-  [(1  -  i£)M  FT  (l  -  «*)]  . 

1  n=M+ 1 '  ;J 


(D-23) 


upon  letting  x  =  e  ,  and  using  [5;  8.380  1  and  8.384  1],  This 
result  agrees  with  (A-ll). 


EXAMPLE  2:  h(x)  =  x,  p(u)  =  exp{-u)  for  u  >  0. 


This  corresponds  to  the  sum  of  the  squares  of  the  M  largest 
exponential  random  variables.  Now,  (D-6)  yields  [7;  chapter  7) 


E(u; £)  =  J  dx  exp(  i£x2  -  x)  =  expjij  +  i£u2  -  uj  v;(a  +  i(3) 


( D-24 ) 
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for  u  >  0  and  £  >  0,  where  real  quantities 


It  should  be  observed/  that  since  l  >  0  in  (D-24),  and  u  >  0  in 
integral  (D-ll)  for  this  example/  then  p  >  0  in  (D-25);  this 
means  that  the  real  part  of  w  in  (D-24)  is  always  positive 
[7;  7.4.13].  This  fact  can  be  used  to  simplify  the  calculation 
of  the  argument  of  w,  which  is  needed  for  (D-ll).  The  final 
result  for  the  characteristic  function  of  sum  s  follows  from 
(D-ll)  in  the  integral  form 

00  1 

fsU)  =  (N-M)  (Jj)  J  du  e~u  (l  -  e~u)  E(u;£)M  ,  (D-26) 

0 

which  must  be  done  numerically,  by  means  of  (D-24). 

EXAMPLE  3:  h(x)  =  x,  p(u)  =  $(u),  C(u)  =  $(u)  for  all  u. 

This  case  corresponds  to  the  sum  of  the  M  largest  normalized 

_ic  2 

Gaussian  random  variables;  here,  $(u)  =  (2n)  exp(-u  /2)  for  all 
u.  In  this  case,  (D-6)  yields,  for  all  u, 

00  2 
E(u;£)  =  J  dx  exp(Hx)  4><x)  =  exp[ii;u  -  y-)  w(^  /21U)  *  (D-27) 

u 

The  corresponding  characteristic  function  of  sum  s  is,  from 

(D-ll), 
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fs(S)  -  (N-M)  (jj)  J  du  ♦  ( u )  *(u)N_M_1  E(u; £ )M  .  ( D-28 ) 

—  00 

Extension  to  a  general  mean  m  and  standard  deviation  a  for 
original  set  {*nl  is  easily  accomplished;  the  result  is 
exp( i£mM)  fs(£o). 

EXAMPLE  4:  h(x)  =  x^,  p(u)  =  $(u)  for  all  u. 

This  example  corresponds  to  the  sum  of  the  squares  of  the  M 
largest  normalized  Gaussian  random  variables.  Now,  (D-6)  yields, 
for  all  u, 

00 

E(u;  £; )  =  J  dx  exp(  i£x2  )  <}>(x)  « 
u 

2 

=  - - - r  exp f-  f-  (1  -  U2)}  wfu  -tj  ( 1  -  U2)**)  .  (D-29) 

2(1  -  H2r  1  )  {  VZ  ) 

The  characteristic  function  of  sum  s  is  now  given  by  (D-ll)  as 

ao 

fs(l)  =  (N-M)  [JJ)  J  du  4>(U)  ^(U)^"1  E(u;£)M  .  (D-30) 

—  00 

This  result  is  easily  extended  to  a  standard  deviation  a  ^  1  for 

2 

original  set  {x  ),  namely  f  ( So  );  however,  it  is  not  easily 

II  S 

extended  to  mean  m  ^  0. 
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The  last  three  examples  of  characteristic  functions  rely 
heavily  on  an  accurate  efficient  routine  for  calculating  the 
error  function  of  complex  argument,  w;  see  (D-24),  (D-27),  and 
(D-29).  By  contrast,  the  characteristic  function  result  in 
( D— 2 3 )  for  the  first  example  is  in  closed  form.  The  following 
example  will  demonstrate  a  possible  limitation  of  a  different 
kind. 


EXAMPLE  5:  h(x)  =  x,  p(u)  -  exp(-u)  uK-1/(K-l)!  for  u  >  0. 

As  in  ( D-22 ) ,  this  is  the  sum  of  the  M  largest  random 
variables  but  where  the  original  random  variables  {*nl  are  now 
chi-squared  with  2K  degrees  of  freedom.  Then,  (D-6)  yields 
closed  form 

E(u;  £)  =  exP(~u( 1-^)  i  >  ‘  - — for  u  >  0  .  (D-31) 

( 1-i?; ) K  k=0 

The  cumulative  distribution  function  is 


K-l  k 

C(u)  =  1  -  E(u;0)  =  1  -  exp(-u)  >  '  7-7  for  u  >  0  . 

k=0 


( D-32 ) 


Substitution  of  these  results  in  (D-ll)  yields  the  characteristic 
function  of  sum  s  in  the  form 


K-l 


fS<£>  =  <N-M>  (h)  I  dU  e'U  T^TT! 


1  -  e 


K-l  k) 

-U  r - -  U 

L - -  ),i 

k=0 


N-M-l 


exp(-u( l-i^) )  r — •  u  (l-i£) 

K  *  k 1 

(1-U)K  k=0  K* 


M 


(D -33) 
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which  must  be  done  numerically  for  each  K  of  interest.  (For  K=l, 
(D-33)  reduces  to  ( D—  23).)  The  latter  sum  on  k  in  (D-33)  is  an 
alternating  one,  which  could  be  troublesome  for  large  u  and/or  l. 
A  possible  alternative  is  to  directly  evaluate  integral  (D-6) 
numerically;  it  is  a  Fourier  transform  for  this  example. 


MEANS  OF  SUM  s  FOR  FOUR  EXAMPLES 

The  mean  of  sum  s  defined  in  (D-2)  was  given  generally  by 
integral  (D-19),  in  conjunction  with  (D-17).  For  example  1,  we 
find  H(u)  =  e-u  (1+u)  for  u  >  0,  and  =  M  where 

N  i 

Y  =  1  +  XZ!  -£■  for  k  >  1  .  ( D-34 ) 

K  n=M+l  nK 

More  generally,  the  k-th  cumulant  of  sum  s  for  this  example, 

X  (k),  is  given  in  (A- 13). 

For  example  2,  there  follows  H(u)  =  2  e-u  (1  +  u  +  u^/2)  for 

2 

u  >  0,  and  =  Mfy^^  +  )  /  where  we  used  (A-17)  -  (A-21).  The 

results  for  mean  in  both  of  these  examples,  involve  only  sums 
of  positive  terms;  see  (D-34). 

For  example  3,  we  have 

00 

H(u)  =  J  dx  x  «)»(x)  =  <j>(u)  for  all  u  ,  (D-35) 

u 

thereby  giving 
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-  M  (N-M)  (JJ)  J  du  *(u)2  *(u)N'M_1  [1  -  *(u)]M_1  ,  (D-36) 

—  00 

which  must  be  done  numerically. 

For  example  4,  H(u)  =  u  ${u)  +  1  -  $(u),  yielding 

00 

*  M  +  M  (N-M)  (Jj)  J  du  u  Hu)2  *(u)N_M_1  [1  -  $(u)]M_1.  (D-37) 

—  00 

Finally,  for  example  5,  we  have 

K  k 

H(u)  =  K  exp(-u)  >  £7  for  u  >  0  ,  (D-38) 

k=0  K* 

with  mean 
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APPENDIX  E.  RECEIVER  OPERATING  CHARACTERISTICS 
FOR  SUM-OF-M-LARGEST  PROCESSOR 


The  sum-of-M-largest  (SOHL)  processor  is  characterized 


according  to  (28)  by  the  summation 


■  r  i'  v  . 
n^T  n  < 


(E-l) 


Here,  random  variables  {x^}  are  the  ordered  version  of  given  data 
{xn}  for  1  i.  n  £  N.  Summation  limit  M  is  a  hypothesized  or 
assumed  value  for  the  number  of  bins  occupied  by  signal.  The 
actual  number  of  occupied  bins  is  M.  The  following  receiver 
operating  characteristics  (ROC)  are  plotted  for  N  =  1024,  and  for 
M  and  M  taking  on  the  values  listed  in  (34)  and  (35), 
respectively.  The  quantity  S(dB)  is  the  common  value  of  the 
signal  power  per  bin  in  dB.  Since  the  noise  power  per  bin  has 
been  normalized  at  unity,  S  is  also  the  signal-to-noise  ratio  per 
bin.  The  number  of  trials  utilized  for  the  detection  probability 
in  each  case  is  noted  on  the  individual  figure.  The  false  alarm 
probability  was  determined  exactly  from  characteristic  function 
(29).  The  bottom-most  straight  line  at  45°  corresponds  to  S  =  0, 
that  is,  S(dB)  *  -®  dB;  it  lies  along  P^  =  P^. 


87 


TR  10739 


89 


TR  10739 


uoi^oa^ag  >o  /<}  i  (  i  qsqojj 


rs 

ro 

II 

£ 


II 

SI 

.3 

X 

o 

in 

u 

o 

vw 

u 

o 

IX 


vO 

I 

w 

QJ 

U 

3 

cr> 

•H 

IX 


vO 

r-H 

II 

X 


II 

2:1 


3 

X 

o 

cn 

u 

o 

v*-l 

u 

o 

ex 


un 

l 

w 

<u 

x 

3 

O' 

*H 

IX 


90 


TR  10739 


uoi%38%ag  Xq  1 1 1 qeqojj 


uoi^oaisg  j.o  XitiiqsqoJcj 


91 


E-«  .001  .01.02  .05  .1  .2  .3  .4  .5  E-6  E-5  E-4  .001  .01.02 


TR  10739 


uoi^aa%8Q  j.0  1 1  t  qeqojj 


92 


Figure  E-9.  ROC  for  SOML 
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Figure  E-57.  ROC  for  SOML,  M=16,  M=64  Figure  E-58.  ROC  for  SOML,  M=16,  M=128 
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Figure  E-81.  ROC  for  SOML,  M=128,  M=2  Figure  E-82.  ROC  for  SOML,  M=128 


TR  10739 


129 


TR  10739 


E 

L. 

to 

E 

» 

M 

id 

U. 


>s 

♦J 


.o 

Id 

.0 

O 

(- 

Q- 


uoi;os;sq  *o  XlH*°iBCl0Jd 


vO 

H 

n 

z 


oo 

CM 


►J 

2 

O 

cn 

1-4 

o 

4-1 


U 

o 

02 


m 

oo 

I 

« 

<1) 

l-i 

3 

CP 

*H 

&4 


130 


TR  10739 


131 


TR  10739 


132 


TR  10739 


uoi*oa*og  j.o  A.%  i  i  | qeqojd 


rn 

II 

2 


lO 

m 

CN 


II 

SI 


2 

o 

to 

1-t 

o 

4-1 


u 

o 

K 


CN 

Ov 

I 

W 

<u 

l-l 

3 

O' 

•H 


U0|}30*3g  i°  lcla90Jd 


CN 

II 

2 


iO 

in 

CN 


II 


21 


►J 

2 

O 


C/5 

l~i 

O 

4-1 


u 

o 

OS 


cn 

I 

w 

Q) 

U 

3 

CT> 

•H 

C*H 


133 


TR  10739 


uoi^oo^oq  j-o  /<% 1 1 1 qsqojjj 


134 


TR  10739 


uonoa^aQ  jo  A^iiiqsqojd 


VO 

H 

S 


VO 

Lf> 

CM 


II 


SI 


J 

s 

o 

CO 

o 

vw 


u 

o 

os 


in 

O' 

I 

a 

<u 

3 

O' 

■H 

(X4 


135 


TR  10739 


136 


TR  10739 


137 


TR  10739 


uonae^oQ  Ximqaqojd 


uou30*»G  ^0  X*n»c!*cl0Jd 


138 


TR  10739 


139 


TR  10739 


CM 

ro 

II 

X 


* 

CM 

rH 

Ld 

II 

X\ 


X 

o 


to 

u 

o 

M-l 


u 

o 

os 


v£> 

O 

t-H 

i 

w 

<u 

Im 

D 

0> 

b 


140 


TR  10739 


141 


TR  10739 


142 


TR  10739 


143 


TR  10739 


144 


TR  10739 


145 


TR  10739 


146 


TR  10739 


147/148 
Reverse  Blank 


TR  10739 


APPENDIX  F.  ON  THE  CHOICE  OF  M  WHEN  M  IS  KNOWN 

The  total  number  of  search  bins  is  N,  and  the  number  of 
occupied  signal  bins  (when  signal  is  present)  is  M,  which  is 
known.  However,  the  locations  L  of  these  M  bins  are  totally 
unknown.  Under  both  hypotheses  HQ  and  H^  the  average  noise 
power  in  all  N  bins  is  1;  under  H^,  the  average  signal  power  in 
the  M  occupied  signal  bins  is  common  value  S,  which  is  unknown. 

The  GLR  test  is  to  sum  the  M  largest  values  of  the  input  data 
{xnJ  and  compare  the  sum  with  a  threshold.  This  is  equivalent  to 
ordering  the  input  data  into  the  set  {x^},  where  >  x^+1  for 
1  £  n  £  N-l,  summing  the  first  M  values,  and  comparing  the  sum 
with  the  same  threshold.  Although  this  test  appears  to  agree 
with  physical  intuition,  simulations  reveal  that  better 
performance,  in  terms  of  Pd  versus  Pf,  are  obtained  if  the 
largest  M  bin  outputs  are  summed,  where  M  is  sometimes  different 
from  M.  This  means  that  the  GLR  test  is  not  optimum  in  this 
situation;  of  course,  the  GLR  procedure  makes  no  claim  for 
optimality,  although  it  frequently  leads  to  a  high  quality  test. 
The  explanation  and  remedy  to  this  apparent  discrepancy  follows. 

Under  Hq,  the  average  level  of  xn  is  1.  On  the  other  hand, 
under  H^,  the  average  level  of  xn  is  changed  to  1+S  in  the  M 
occupied  signal  bins,  but  only  for  these  M  bins  which  contain 
signal.  The  remaining  N-M  bins  still  have  average  level  1  under 
H^ .  Unfortunately,  since  the  information  about  which  particular 
bins  are  occupied  is  unknown,  we  are  led  to  consider  the  ordered 
data  {x^},  which  contain  identical  information  to  {xnJ  under  this 
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situation.  Even  if  we  ignore  any  guidance  from  the  GLR 
procedure,  a  justification  for  considering  ordered  data  is  that 
the  presence  of  signal  results  in  M  larger  data  values  on 
average,  and  therefore,  larger  values  should  get  more  weight. 

For  the  ordered  data  {x^},  the  situation  regarding  average 
values  is  somewhat  different.  Under  HQ,  the  average  level  of  x^ 
decreases  monotonically  with  n.  When  signal  is  added  to  M  bins 
under  and  the  data  { xn |  ordered,  the  average  levels  of  all  N 
data  values  x^  are  increased,  not  just  the  first  M  bins.  In 
order  to  demonstrate  this  claim,  let  the  first  M  bins  contain  the 
signal;  this  is  no  loss  of  generality,  since  we  are  not  going  to 
use  this  fact  in  our  data  processing.  Now,  under  H^,  the  act  of 
ordering  the  measured  data  {xnJ  evicts  some  of  the  smaller  signal 
members  from  their  initial  locations  in  set  [1,M]  into  new 
quarters  in  the  set  of  numbers  [M+1,N]  in  data  sequence  {x^j . 
Thus,  when  is  prevalent,  the  signal  addition  and  bin  movement 
raises  the  average  levels  of  {x^},  not  just  for  1  <  n  <  M,  but 
also  for  some  n  values  larger  than  M.  Some  examples  have  shown 
increases  all  the  way  out  to  n  =  N. 

For  improved  performance,  we  must  be  willing  to  look  for 
average  deflections  in  any  and  all  bins.  Therefore,  even  though 
we  may  know  that  only  M  bins  are  occupied  by  signal,  we  must 
process  M  bins  of  the  ordered  data  {x^{,  where  M  can  be  larger 
(or  smaller)  than  M,  due  to  the  spillover  effect  described  above. 
In  order  to  settle  on  near-optimum  values  of  M  to  use,  it  is 
necessary  to  conduct  simulations  of  the  receiver  operating 
characteristics  for  various  combinations  of  N,  S,  M,  and  M. 
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